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ABSTRACT 

I give a pedagogical explanation of what it is about quantization that makes 
general relativity go from being a nearly perfect classical theory to a very 
problematic quantum one. I also explain why some quantization of gravity 
is unavoidable, why quantum field theories have divergences, why the diver- 
gences of quantum general relativity are worse than those of the other forces, 
what physicists think this means and what they might do with a consistent 
theory of quantum gravity if they had one. Finally, I discuss the quantum 
gravitational data that have recently become available from cosmology. 
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1 Introduction 



Gravity was the first of tlie fundamental forces to be recognized and it will be 
the last to be understood. Its early recognition came because it is both long 
range and universal; gravity acts over macroscopic distances, and no one has 
ever found a way to screen it. Mankind's problems comprehending quantum 
gravity are the subject of this article, but they derive from two basic facts: 

1. Humans are not good at guessing fundamental principles without guid- 
ance from nature; and 

2. Gravity is such a weak interaction that nature doesn't provide much 
guidance in the quantum regime of microscopic sources. 

We know there is something wrong with perturbative quantum general rel- 
ativity because one cannot consistently absorb the divergences it produces 
without introducing new degrees of freedom that would make the universe 
blow up [Il[2l[3llll[5l[6l[7|. Obtaining that result was the work of great 
physicists over many decades, but their achievement will remain incomplete 
until we can identify the problem and fix it. 

I wish I could present a solution to quantum gravity, or at least a program 
that could reasonably be expected to provide a solution. Because that is not 
possible I shall endeavor to instead give a clear explanation of the problem. 
This entails answering a number of questions: 

• What is the distinction between classical physics and quantum physics 
that makes general relativity give such a wonderful classical theory of 
gravity and such a problematic quantum one? 

• Why do we have to quantize gravity? 

• Why do quantum field theories have divergences? 

• Why are the divergences of quantum general relativity worse than those 
of the other forces? 

• How bad is the problem? 

• What are the main schools of thought about quantizing gravity and 
why do they disagree? 

• What would we do with the theory of quantum gravity if we had it? 
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I will also comment on the quantum gravitational data that has recently 
become available. 

The tale 1 have to tell is of necessity a complex one, requiring many digres- 
sive explanations. However, there is no need for the exposition to transcend 
the knowledge one expects of any physics graduate student. Because every 
one of the basic issues behind the problem of quantum gravity has a coun- 
terpart in either electrodynamics or introductory quantum mechanics, I shall 
use those subjects as paradigms. This is not condescension; even experts can 
benefit from occasionally viewing a tough problem in a general way, without 
becoming lost in technical details. 

I shall work in four spacetime dimensions, with coordinate points labeled 
thus, x'^ = {ct,x). A symbol with an arrow over it denotes a 3- vector, and 

1 employ the usual notations for scalar and vector products. Differentiation 
with respect to time is denoted with a dot; the gradient operator is V. To 
be clear, and for future reference. Maxwell's equations in MKS units are, 

€oV-E = p , V-5 = 0, (1) 
—V X B-eoE^ J , V X E + B ^0 . (2) 

— * — * 

Here the various fields are: electric, E(t,x); magnetic, B{t,x); the charge 
density, p{t,x); and the current density, J{t,x). The two constants are the 
electric permittivity of free space, eg, and the magnetic permeability of free 
space, Pq. 1 denote spatial Fourier transforms with a tilde, 

f{t,k) = Jd'xe-'^-^f{t,x) ^ f{t,x) = J-0-^e^^'-'f{t,k) . (3) 

Finally, I use the term "classical" to mean "not quantum," irrespective of 
relativity. So it is perfectly valid to speak of "classical general relativity." 
The adjective for suspending relativity is "nonrelativistic." 

2 Perturbative Quantum General Relativity 

The central problem of quantum gravity is that the computational techniques 
used with great success for the other forces do not give consistent results 
when applied to quantum general relativity. To understand the problem 
I will have to explain a little bit about general relativity, what it means 
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to quantize a theory, and the only technique we so far have for computing 
things in quantum field theory. Then I describe renormalization from the 
perspective of polarization in electrodynamics, and I explain why the only 
way of consistently renormalizing perturbative quantum general relativity 
would make the universe virulently unstable. The section closes with a brief 
discussion of fixes that don't work. 

2.1 General Relativity 

One defines a physical theory by specifying three things: 

• The dynamical variable; 

• How the dynamical variable affects the rest of physics; and 

• How the rest of physics affects the dynamical variable. 

For example, the fundamental dynamical variables of electromagnetism are 
the scalar potential x) and the vector potential A{t, x), whose derivatives 
give the electric and magnetic fields, 

E = -V$ -A and B = V x A. (4) 

They affect a particle of charge q which has position x and velocity v through 
the Lorentz force, 

F = qE{t, x)+qvx B{t, x) . (5) 

And the various charges and currents affect them through the Maxwell equa- 
tions which have the general structure, 

d\<l>,A) = {p,j). (6) 

My notation is that d'^{^,A) stands for the derivative with respect to any 
two coordinates of any combination of $ and A. For the purposes of this 
article no greater specificity is required than to note: 

1. The electrodynamic field equations are linear in the potentials; 

2. The electrodynamic field equations involve second derivatives of the 
potentials; and 
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3. The electrodynamic field equations are sourced by the charge density 
p(t,x) and the current density J(t,x). 

In these terms, the dynamical variable of general relativity is known as the 
metric field, g^u{t, x). It is a 4 x 4, symmetric matrix whose components are 
functions of space and time. It affects the rest of physics by controlling the 
physical distances and times between points. Recall that in special relativity 
the frame independent concept of the distance between nearby points x^ and 
x^ + dx^ is given by the "invariant interval" , 

{dS^) special = -C'de + dX-dX. (7) 

relativity 

The invariant interval of general relativity is, 

{ds') = ^ E 9 x)dx^dx^ . (8) 

relativity ^—Q iy=0 

The fact that the metric defines true distances and times affects how deriva- 
tives and integrals are constructed in a way that is analogous to coupling 
rule of electrodynamics. The details need not concern us. 

The rest of physics affects the metric through the Einstein equations 
which have the general form, 

A{g)d^g + B{g)dgdg + Ag 

My notation is that d^g stands for the derivative of any component of the 
metric tensor with respect to any two coordinates. In the same sense, dgdg 
stands for any product of first derivatives of the metric. The parameter A is 
known as the cosmological constant. It is often set to zero in general relativity 
but retaining it will facilitate our eventual discussion of renormalization, and 
nature seems to have chosen a small nonzero value for it [3 [9]. At the level 
I propose to work one need only note: 

1. The gravitational field equations are not linear in the metric; 

2. The gravitational field equations involve up to second derivatives of the 
metric; and 

3. The gravitational field equations are sourced by the stress-energy tensor 



ll.V 
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The stress-energy tensor T^,^ has the units of energy per volume (which 
is the same as force per area, or stress) and its various components have the 
meanings, 



^00 

T- ■ 



energy density , (10) 

momentum density in the i — th direction , (11) 

energy flux in the j— th direction , (12) 

i — th component of momentum flux in the j— th direction. (13) 



(The "flux" of any quantity in the j-th direction is the amount of that quan- 
tity which passes through a unit area perpendicular to the j-th direction, 
per unit time.) The stress-energy tensor is composed of the other fields in 
physics, and it also depends on the metric. For example, if we set the metric 
equal to the value it has in special relativity (that is, goo = —1, goi = and 
9ij — the energy density contributed by electromagnetism is, 

Tooit, x)] =^eo[E-E + c'B ■ b] . (14) 

space 

Of course gravity cannot be absent if there are electrodynamic (or any other) 
fields present so the actual expression for Too depends upon the metric in a 
manner that need not concern us. It is worth commenting that the non- 
linearity of the gravitational field equations, and the fact that their sources 
depend upon the metric they help to determine, means that no general solu- 
tion of the Einstein equations is known. This is why physicists resort to the 
perturbative approximation technique I will describe in subsection 2.3. 

It turns out that all fundamental force fields have a triune nature: one 
part is completely arbitrary, another part is totally determined by the sources 
of the force, and the final part consists of independent degrees of freedom. For 
example, the completely arbitrary part of electromagnetism is the ability to 
change the scalar potential and vector potentials by a gauge transformation, 

$(t,f) — > ^{t,x) +e{t,x) , (15) 
A{t,x) — > A{t,x) -Ve{t,x) , (16) 

where 6{t, x) is an arbitrary function of space and time. One can easily 
check that the transformation fllStllGp makes no change in the electric and 
magnetic fields (jlj). One can also show that it makes Maxwell's equations 
([HH]) consistent with current conservation. 
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The other two parts of any force field are illustrated by solving for 
the magnetic field, assuming the current density is some known function, 

f .^.^ /•* , sin[cA;(t-t')] -r A 

B{t,x)= e"'^ dt' — ^-^ '-^ik X J{t',k) 

J (ztt)'^ jo ckeo 

+ /(0ie*"^|^o(^) cos(cH) - ^ kx^oik) sin(cH)|. (17) 

The first line of ( ITTj) give the part of the magnetic field which depends upon its 
source, which is the current density. This part of the field contains important 
physics, but it isn't an independent degree of freedom because knowing the 
current density J{t, x) fixes it completely. The technical name for such a 
field is "constrained". The "dynamical" part of B{t,x) comes on the final 
line of f|T7j) which contains the independent, purely electromagnetic degrees 
of freedom that would be present even if the current density was zero for 
all time. These extra degrees of freedom consist of pure electromagnetic 
radiation that corresponds to the "photons" of quantum electrodynamics. 

Gravity has the same general structure as electromagnetism and all the 
other fundamental force fields: part of the metric can be changed arbitrar- 
ily by a symmetry transformation known as general coordinate invariance; 
another part of the metric is determined by the stress-energy tensor; and a 
third part contains independent degrees of freedom, the pure gravitational 
radiation which comprises the gravitons of quantum gravity. Because the 
Einstein equations ([9]) are not linear, the breakup of g^uit,x) into these 
three constituents is vastly more complex than for electromagnetism, but 
that is another detail which need not concern us here, although it is a major 
headache for people who work in gravity. A fact of great importance for the 
discussion of renormalization is that general coordinate invariance implies 
the left hand side of the Einstein equation ([^j is the unique combination of 
the metric and no more than two of its derivatives which is consistent with 
the conservation of stress-energy. 

To understand quantum field theory it is important to be clear about the 
distinction between the constrained and dynamical parts of a force field. It 
is the constrained part of electrodynamics which holds the Hydrogen atom 
together. For example, if q{t) and p{t) are the position and momentum of 
the electron then the Hamiltonian for nonrelativistic Hydrogen is, 

ff=M!_,4(,T ,here *® = ■ (18) 
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Note that the scalar potential $(g) is a completely determined function of 
the electron's position q{t). If we were doing quantum mechanics it would be 
a quantum operator, but only because the electron's position is a quantum 
operator; it would not possess any independent quantum degrees of freedom 
of its own. Of course the nonrelativistic Hamiltonian f|T8l) is only an approx- 
imation and the full, relativistic system does incorporate quantized photon 
degrees of freedom. These degrees of freedom are needed to explain the Lamb 
shift of about one part in 10^ in the frequencies of light emitted from decays 
of the ^5*1 and ^Pi states [10]. However, it is worth emphasizing that the 
basic structure of tiie atom is determined by the constrained part of the elec- 
trodynamic potentials and one has to work quite hard to see even the first 
evidence for quantized photons from it. For example, the Lamb shift was 
only detected by stimulating a transition between the two levels. 

Now consider the vastly weaker gravitational force. We have so far not 
been able to directly detect gravitational radiation, much less the gravita- 
tional radiation from a quantum transition, or the even subtler shift due to 
quantized gravitons. The gravitational effects which hold the solar system 
together derive from the constrained part of metric. There is only indirect 
evidence that gravitational radiation exists pjj, and there is no evidence at 
all for its quantization. Which brings me to one of the major points of this 
article: some quantization of gravity is inevitable because part of the metric 
depends upon the other fields whose quantum nature has been well established. 
It turns out that the first divergences of quantum gravity are due to quantum 
effects from these other fields |2l |3l H]. The quantum effects of gravitons — 
if there is gravitational radiation, and if it is quantized — do cause problems 
[6|,[7j, but these difficulties occur at higher order in the approximation scheme 
described in section 2.3. So the central problem of quantum general relativity 
has nothing to do with gravitons. 

2.2 Quantum Mechanics 

It will be noted that I have written field equations such as and even 

solved them (fT7|) . without stating whether the system is classical or quantum. 
There is a good reason for this: it doesn't matter! The operator equations of 
motion in the Heisenberg Picture of quantum mechanics are the same as those 
for the corresponding classical theory. Further, "solving" these equations 
means precisely the same thing: one expresses the dynamical variables at 
any time in terms of the initial values of the dynamical variables. Those 
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initial values are the fundamental degrees of freedom of the system, and 
the only difference between classical physics and quantum physics is what 
they represent. In classical physics the initial values are just numbers and 
each of them can take any value, whereas in quantum physics they are non- 
commuting operators which must obey the Uncertainty Principle. 

An example which has great significance for our discussion is the simple 
harmonic oscillator. The dynamical variable is the position as a function of 
time, q{t). For the moment we imagine this system to exist in isolation, so 
it has no effect on the rest of the universe. Nor does the rest of the universe 
affect it; its dynamics are controlled by its Lagrangian and Euler-Lagrange 
equations. 



2 2 

-mu q 
2 ^ 



dq 



d^dL 
dt dq 



— — —m 



q{t)+u^q{t) =0. (19) 



The general solution in the sense of fundamental theory is, 

?(^) — Qo cos{u}t) + — sin{u}t) , 

where qo = q{0) and % = q{0). 

Now break up the trigonometric functions using the Euler relation. 



(20) 



COS[U!t) — - 



sin(u}t) — - 



(21) 



The solution can be expressed as a sum of positive and negative frequencies. 



go H 

u ■ 



iuit 



qo 

u ■ 



(22) 



As noted above, this same solution applies both for the classical theory and 
for the quantum one. In the latter we can recognize that the operator coeffi- 
cient of e~"^* must lower the energy by hcu, whereas the operator coefficient 
of e*'^* must raise it by the same amount. The canonical momentum tells us 
how qo and commute. 



- ^ 
^ dq 



mq 



go, go 



ih 
m 



(23) 



We can use this to canonically normalize the raising and lowering operators, 

a, a^]=l. (24) 



muj 
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And the final result for the position operator takes the form, 

q{t) = ae{t) + e*{t) , 
where the "mode coordinates" are, 



a = 



mu 



2h 



(go + — ) 
V a; / 



and the "mode functions" are, 



e{t) 



-iuit 



2muj 



and 



and 



~2h 



UJ 



e\t) ^ 



I h 

2muj 



(25) 



(26) 



(27) 



The harmonic oscillator is so important because of an amazing fact: the 
spatial Fourier components of every quantum field degenerate to independent 
harmonic oscillators in the limit that interactions vanish. For example, if 
we turn off the current density in (fT7|l then the spatial Fourier transform of 

the magnetic field vector B{t, k) behaves as a pair of independent harmonic 
oscillators for each wave vector k. The resulting free field mode sum can be 
rendered thus, 

B{t,x) = f ^ ^ lik X e(t,x; k, X) a{k, X) — ik x e*(t,x; k, A~) a\k, A)|. 

(28) 

In this expression the canonically normalized mode coordinates are, 
a{k,X) = i\l . . 6 — i) 



Ahck 



a\k, X) = —i 



eo 



Ahck 

The associated mode functions are, 



crxBo{k) - Eo{k) 



9+iX4> ■ crxBo{-k) - Eo{-k) 



e(t, x; k, X^ 



Ae^ck 



e + iX(j) 



ikct+ik-x 



(29) 
(30) 

(31) 



And it should be noted that I have expressed the wave number k in spherical 
coordinates with the usual spherical unit vectors, 

(32) 
(33) 
(34) 



f = (^sin(6') cos(0), sin(6') sin(0), cos(6') 
9 = (^cos(^^) cos(0), cos(6') sin(0), — sin(6') 
— sin(0), cos(0), 
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In quantum electrodynamics acting a^(k,X) on a state adds a photon 

— * 

with energy E = hkc, 3-momentum p = hk and polarization A. (A = +1 
stands for left-handed, circular polarization and A = — 1 is right-handed.) 
Acting a{k, A) would remove a photon with the same quantum numbers. It 
should be noted that the "particles" of fundamental theory are always the 
Fourier modes of quantum fields. For example, the electrons and positrons 
of quantum electrodynamics are represented by terms in the free field mode 
sum for the electron field, 

f d?k { -►-> ^ ->! 

*j(i,f) y^^-p ^ |£i(t,f;A;,s)6(A;,s) -F£i(t,f;A-,s) c^(A;,s)|. (35) 



Acting c\k, s) on a state adds a positron of energy E — ^ {mc^y + {Hcky , 3- 

— * 

momentum p — hk and z-component spin (in its rest frame) s — Acting 
b{k, s) on a state removes an electron with the same 4-momcntum and spin. 
As always, the mode coordinates are simply linear combinations of the initial 
values of the dynamical variable, which are the true degrees of freedom of 
system. 



b{k,s) 




i=l 




1=1 



J2u*{k,smO,k) , c\k,s)^J—J2v*{k,s)^.iO,k). 



(36) 

(37) 
(38) 



he 



Here the spinor wave functions arc, 
u{k,s) = 

v{k,s) = 



2{E+mc'^) 

I he 



E+rn& 
Tic 
+m< 

he 



2(£;+mc2) 



he 

he 




and the various 2-component quantities are familiar from nonrelativistic 
quantum mechanics. 



1 
1 



a 



1 

1 



a 







, (39) 
.(40) 
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The mode functions in fl35p are, 



(41) 

At this point I must discuss a httle about quantum states. Operators 
such as go and go have the potential for being anything; it is the state wave 
function which describes how they are distributed. These wave functions 
are time independent in the Heisenberg Picture of quantum mechanics; it is 
the operators which evolve in time. In the position representation we can 
write the wave function as ip{x) and the two fundamental operators act by 
multiplication and differentiation, 

qQi){x) = xip^x) , (42) 

qoH^) = -^^'(^)- (43) 

The inner product between any two states is defined by integration, 



Utp)= / dx(j)*{x)4j{x) . (44) 
And a very important property of any state is normalization, 

= j dx^*{x)^{x) = 1 . (45) 



(This is what puts the "quantum" in quantum mechanics.) The expectation 
value of any operator O{qo, in the state \ip) is, 

/ / \ \ / if) 3 \ 
(V'0(go,go)k)= / dxr{x)0{x, ^)Hx). (46) 

Of course expression (146!) suffices for time dependent operators such as q(t) 
because they can be expressed in terms of go and go. 

The notions of state wave functions, and inner products involving them, 
all have straightforward generalizations to quantum field theory (if one is 
good with functional calculus!) however, they are very seldom used. The 
reason for this is that quantum field theories possess an infinite number of 
mode coordinates, one or more for every wave vector k. Only a finite number 
of these modes can be excited because it costs energy to excite a mode and 
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there is only a limited amount of free energy available. Hence there are so 
many more ground state modes than excited ones that most quantum field 
theoretic effects derive from the vast number of modes in their ground states. 

The Uncertainty Principle provides a powerful, intuitive way of using 
classical physics to understand the effects of modes which are in their ground 
states. When the expectation values of Qq and go ^-re zero we can express the 
Uncertainty Principle as follows, 

mim ■ mim > ^ ■ (47) 

Equality is achieved for a Minimum Uncertainty state. For such a state we 
can think of the Hamiltonian as a function of Qq alone, 

H = ^mql + ^muj^l + ]^mw''ql = E{qo) . (48) 

The term fi^ /SmqQ in E{qo) is known as Uncertainty Pressure. It reflects 
the physical import of the Uncertainty Principle, which is that concentrating 
go more tightly about its mean value of zero makes go proportionately less 
concentrated. With Uncertainty Pressure we can understand many quantum 
effects classically. For example, the minimum energy is. 



dE 2 / h ^ ^ u 

— = 3 + mu; go =^ gmin = \ and E^^^ = -Tiw . 

oqo 4mgo V 2ma; 2 

(49) 

One doesn't always get the factors of two right with this level of analysis but 
it is a powerful technique for understanding complex things in a simple way, 
and I will apply it to explain where the divergences of quantum gravity arise 
and why they are worse than those associated with the other forces. 



2.3 Perturbation Theory 

The alert reader will have noted that the exact solutions of the previous 
subsections were all obtained for noninteracting theories. There is a good 
reason for that: not a single interacting field theory has been solved in four 
spacetime dimensions. Note the essential distinction between "exactly solv- 
ing a theory", which means expressing the dynamical variable at any time 
in term of arbitrary initial value data, and obtaining an "exact solution to 
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the field equations," which means solving the equations of motion for one 
particular choice of initial values. The various "exact solutions" of Einstein's 
equations involve setting almost all the degrees of freedom to zero. This is 
fine for classical physics, but it is not permitted in quantum mechanics. For 
example, in free quantum electrodynamics the mode coordinates a{k, A) and 
a^{k, A) of expression fl28l) are not commuting variables, 

a{k, A), a\k', A')] = 6xyi2n)^S^{k-k') . (50) 

A classical picture is possible, but one must imagine that each mode experi- 
ences the 0-point motion we found above for the harmonic oscillator. 

Because there is no hope of exactly solving the field equations for arbitrary 
initial value data, computing anything in quantum field theory requires the 
use of approximation techniques. The standard one is known as perturbation 
theory and a good way of motivating it is by observing that, even though 
our expression (ITTI) is correct, it doesn't give the magnetic field because the 
current density J(t, x) is affected by the electrodynamic potentials. One can 
see this in quantum electrodynamics for which the charge density and the 
current density are formed from the electron field \I^j(t,x), 



p{t,x) = ^^\t,x)^{t,x) , J{t,x) = j^\t,x)(^^ l)'^{t,x) 

And the Dirac equation for \1/ involves the potentials $ and A 
d ze$i ^ /—a 0\ ieA\ ^ imc / I 



(51) 



dct he 



(«^) 

One can see what must happen, even without working out the details: 

• 0-point motions of \1/ modes engender $ and A; 

• These electromagnetic fields change 

• Which changes $ and A; 

• Which changes and so on. 

The progression is endless, and we could just have easily begun it with 0-point 
motions of the electromagnetic fields affecting The process could be made 
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to terminate if an external force fixed or if we made some very special 
choice of initial values. But there are no external forces in fundamental 
theory, and quantum mechanics demands that we consider generic initial 
value data. 

The cycle of action and reaction I have sketched precludes obtaining the 
exact solution, but a good approximate solution can be found if each cycle 
of action and reaction induces a smaller subsequent cycle. So we start with 
noninteracting, charged matter and electrodynamics, which defines the 0th 
order. In quantum electrodynamics the 0th order would be the free field mode 
sums (!35l) and the analogue of (l28l) for the vector and scalar potentials. The 
1st order perturbation for the electrodynamic potentials comes from solving 
Maxwell equations with the charge density and current density formed 
from the fixed, 0th order electron field fl35l) . The 1st order perturbation for 
the electron field comes solving the Dirac equation fl52|) with the terms 
involving the electron charge e evaluated at 0th order. 



The second order perturbation is obtained by computing what the first order 
fields do, and so on. If the effect of each cycle is reduced by a small enough 
factor then we do not need to carry the process out for many cycles before the 
theoretical prediction is as accurate as any experiment that can be performed. 
This is how perturbation theory works. 

It should be clear from the sketch I have given that each order of quan- 
tum electrodynamic perturbation theory comes with an extra power of the 
electron charge e. It should also be clear that the effects of different lower 
order modes add in perturbation theory. In a fixed volume V one sums over 
the Fourier wave vectors k using the famous density of states formula. 



Because we typically want densities, the factor of V drops out. Because there 
are as many positron modes as electron modes, with the same kinematic 
properties and opposite charge, it turns out that the odd powers of e cancel 
out and the net effect comes from even powers. It also happens that one 
always gets a certain number of 27r's from fl5^ . and some factors of eo, h and 




(53) 




(54) 
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c in the dimensionless combination, 

^ ^ ^ J- f55) 
An'^eohc tt 430 ' ^ ^ 

This is the expansion parameter of quantum electrodynamics and its small- 
ness is what makes perturbation theory effective. 

I have glossed over quite a number of details in describing how perturba- 
tion theory works for quantum electrodynamics, and I am not going to be 
any more thorough for quantum general relativity. However, the coupling 
constant is important because it controls how reliable perturbation theory 
ought to be. One can see from the Einstein equation ([9]) that the source 
terms involve G/c^, so one more power of this must obviously appear at each 
successive order in perturbation theory. Each higher order in perturbation 
theory also contributes a factor of 1/hc, just as in quantum electrodynamics 
(and all other quantum field theories). The resulting combination of funda- 
mental constants has the dimension of an inverse energy squared, 

G / 1 X2 / 1 

(56) 



hc> V2.0xl09J^ Vi.2x 1019 GeV 

Therefore quantum gravitational perturbation theory must amount to an 
expansion in the dimensionless parameter GE'^/hc', where E is some energy 
in whatever process is under study. 

An issue which sometimes confuses people is that the series approximation 
perturbation theory generates is only asymptotic, rather than convergent. 
Suppose we have function S{x) and we obtain an A^-term series approxi- 
mation Sn{x) in terms of some standard set of functions fn{x) which are 
organized so that, for the a;-range of interest, 

foix) > fi{x) > f2{x) > fs{x) > ... . (57) 

The typical case is that fn{x) = but 1 want to allow for fractional powers 
or powers times logarithms. The series approximation Sn{x) takes the form, 

N-l 

Swix) = Snfn{x) , (58) 

n=0 

where the s„ are numbers. We say that the series is convergent if taking 
to infinity recovers the original function S'(x), 

Convergent: =^ lim S{x) — Sn{x) =0. (59) 
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We say the series is asymptotic ai x = Xq if the difference between S{x) and 
Siy{x) goes to zero faster than fN-i{x) as x approaches Xq, for any fixed A^, 



Asymptotic : 



hm 

x—>xo 



S{x)-Sn{x) 

fN-l{x) 



. 



(60) 



It is entirely possible for a series to possess both properties but it is a fact 
that perturbative expansions of quantum field theory are only asymptotic 
(for zero coupling constant), not convergent. 

The difference between a convergent series and an asymptotic series is 
beautifully illustrated by a special function known as the "Exponential Inte- 
gral" [12], 



Ei(x) 



oo g~ 

dt- 



t 



(61) 



A convergent series for small x can be obtained by extracting the integral 
down to constant, and then adding zero in the form of — ln(x) + 

ln(a;). 



/•oo g ^ r 
E,ix) = dt — + 

Ji t Jo 



dt 



t 



ln(x) 



dt 



re 



■1 



t 



(62) 



The first two integrals of (l62l) just give minus the Euler-Mascheroni constant 
7 ~ .577. The final integral of (!62l) can be evaluated by expanding (e~*— l)/t 
in powers of t and then integrating termwise. The resulting expansion is. 



Ei(a;) = -7 - ln(a;) - ^ 



n=l 



n ■ n\ 



(63) 



Of course this series is also asymptotic for x ^ 0. 

One can get a asymptotic expansion of Ei(x) for large x by successive 
partial integrations. 



Ei(a;) 



N-l 

E 

n=0 



n\ e 



X 



n+l 



oo g 

dt 



(64) 



The asymptotic series (for x oo) is the first term of flM|) . the second term 
is the remainder. Its magnitude can be bounded by replacing the 1/t^^^ 
with l/x^+\ 



Rn = N\ 



oo g 

dt 



< 



X 



N+l 



dt e 



-t 



A^!e-^ 



(65) 
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Hence the magnitude of the difference between Ei(x) and S^^x) is, 



EAx) 



-y 



(-l)"n! e"^ 



n=0 



-X 



< ^ ■ (66) 



This proves the series is asymptotic as x — > cxd. To see that it is not conver- 
gent, note that the sum of Rn and Rn+i can be evaluated exactly. 



°° a r e"* 1 A^! e" 



Rn + Rn+1 = -m I dt i: = . (67) 



X 



dt 



Increasing by one changes the right hand side of fl67|) by a factor of (A^+ 
l)/x. As long as this is less than one, accuracy is increased, but for any fixed 
value of X there must eventually be an A^ past which accuracy deteriorates. 
The best asymptotic series approximation for Ei(x) is therefore obtained by 
carrying the expansion out to about N ^ 1/x and no further. 

The fact that nonconvergent asymptotic series expansions cannot be made 
arbitrarily accurate bothers the mathematically inclined. It does preclude 
defining a quantum field theory by its perturbative expansion, however, 
whether or not there is any practical problem depends upon the value of 
the coupling constant and the state of the experimental art. If the coupling 
constant is so small that no experiment can measure the deviation between 
reality and the best asymptotic series result then there is no operational 
problem. That is the case for quantum electrodynamics at conventional en- 
ergy scales. A process which involves 2N vertices will acquire a factor of 
(a/27r)^ from its coupling constants and the factors of 2 and tt from momen- 
tum integrations. There will also be a multiplicity factor of about (2A^— 1)!!, 
so one expects the series to begin diverging at about A^ ~ vr/a ~ 430. At 
that point the accuracy is so great one can only estimate it using Stirling's 
approximation, 

Q"x(2iV-l)!!^y2(^)"^10-. (68) 



No experiment can approach that accuracy; the current state of the art is 
sensitive to about the fourth order in a, not the four hundredth}. 

On the other hand, the coupling constant of the strong interaction is 
large enough, at low energies, that the asymptotic expansion is practically 
worthless. So how about quantum general relativity? We have seen that 
perturbation theory generates an expansion in powers of GE'^/hc'. For the 
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highest proton energy we shall be able to reach at the LHC this number 
would be about, 



GE^ / 7x 103 GeV x2 ^ ^ ^ 

~ tttt; — ~ 3.4 X 10 . (69) 

he' V 1.2 X 1019 GeV^ ^ ' 

This means the perturbative series for quantum general relativity should be 
wonderfully more accurate than that of quantum electrodynamics, for which 
analogous factor is a/27r ~ 1.2 x 10"^. In fact it should be so good that 
some special circumstance would be needed to make quantum gravitational 
effects observable at all. I will have more to say about this later but let us 
for now note that the asymptotic nature of perturbative results is not high 
on the list of problems for quantum gravity. 



2.4 Renormalization 

It turns out that the perturbative corrections from any 4-dimensional quan- 
tum field theory diverge when they are expressed in terms of the parameters 
such as e and m which appear in the field equations. The reason for this 
divergence is very simple: all the modes contribute a little bit, and there are 
so many modes that one gets a divergence from the sum fl5^ over them. For 
all the other quantum field theories these divergences can be absorbed by 
regarding the parameters of the field equations to depend in a divergent way 
upon physically measured quantities such as the electron charge and mass. 
When that is done correctly the perturbative corrections really are small, if 
the coupling constant is, and they agree wonderfully with experiment. 

The procedure for using the parameters of the field equations to absorb 
divergences is known as renormalization. I will describe how it works by 
showing that the vacuum polarization of quantum electrodynamics is com- 
pletely analogous to the phenomenon of classical polarization in a medium. 
Then I will use the fact that gravity couples to stress-energy, rather than 
charge, to show that renormalizing the divergences of quantum gravity in- 
volves adding new sorts of terms to the gravitational field equations. These 
new terms would remove all the divergences [S], but we will see in the next 
subsection that they would also make the universe blow up. That is the 
central problem of perturbative quantum general relativity. 

Let us consider the phenomenon of polarization in a static, classical 
medium. The actual charge density of the medium consists of an enormous 
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number of positive and negative point charges located at equilibrium po- 



sitions X„ 



p{x) 



undisturbed 
medium 



(70) 



The medium as a whole is electrically neutral; it is only microscopically that 
one can see its vast collection of positive and negative charges. If we apply 
a static electric field E{x), the charges shift to new equilibrium positions 
Xa — > Xa + Ax a- Wc cau expand the density of each charge around its 
equilibrium value, 



qaS^{x — Xa—AXa^ = qaS^{x — Xa) — V ■ ^q, AXq,5^ (f — Xq, 



+ 



(71) 



Hence we can write the charge density of the disturbed medium as its undis- 
turbed value plus a series of corrections, 



p{x] 



disturbed 
medium 



p{x) 



undisturbed 
medium 



-V- 



"^qaAXaS^ix-Xa 



+ 



(72) 



We have already noted that the undisturbed charge density appears to be 
zero on macroscopic scales because the positive and negative charges cancel 
one another. However, the sum in the square bracketed term on the right 
hand side of fl72l) tends to give a coherent effect because the positive charges 
move with the applied electric field and the negative charges move the other 
way. The higher terms in the expansion tend to be small because the charges 
don't move much, so the result is. 



p{x] 



H' t ,- -V ■ P(x) , 

disturbed V / * 



where the polarization is. 



(73) 



(74) 



Now suppose we add a small number of "free" charges to the vast collec- 
tion of "bound" ones in the medium. The Gauss law equation reads. 



eoV ■ E ~ Pfree - V ■ P . 



(75) 



The smart way to solve this equation is to combine P with eoE to form the 
electric displacement D, 



(76) 
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For the case of a hnear, isotropic medium the polarization is proportional to 
the electric field, 

P{x) = Aex E{x) . (77) 

In that case we can subsume the effect of the medium into a change of the 
electric permittivity, 

eV ■ E = pfree where e = eo + Ae . (78) 

Let us now switch from classical physics with a medium to quantum 
electrodynamics in empty space. Of course the space can never really be 
empty because it is pervaded by the electron field '^i{t,x), which gives the 
charge density of quantum electrodynamics, 

p(t,x) = |^*(t,x)^,(t,x). (79) 

As we saw in expression fl35l) . the electron field consists of a an infinite 
collection of the operators which create and destroy charged particles. We 
also saw that it is possible to think about this sum of operators classically 
provided one imagines each mode to be executing 0-point motion. When an 
electric field is applied these 0-point motions change, and that produces an 
observable, coherent effect, just as it does for the classical medium and for 
the same reasons. 

From the free field expansion (135|) one can see that the spacetime depen- 
dence of 0-point motion is characterized by mode functions fl4ip . The part 
of interest to us is the oscillatory factor on the electron creation operator 
b^{k,s) in 'if*{t,x), 



e 



iEt/h-ik-x ^j^g^g ^ _ J(mc2)2 + {hcky . (80) 



We might cancel the spatial phase by combining this with a positron creation 
operator of opposite momentum and spin in \E'(t, x). 



e 



lEt/h—ik-x 



■%\k, s) X e'^*/'*+''=-V(-fc, -s) , (81) 



but nothing can be done about the temporal phase factor. This temporal 
phase factor means that effects from this "virtual" electron-positron pair 
cannot remain coherent longer than about At ~ h/E. This is a very short 
time; the longest lived mode is the one with /c = 0, 

n 10-3^ J-s , , 

■ 10-^^ s . (82) 



fc=o mc2 (9 X 10-31 kg) (3 x 10^ m/s) 
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Let us compute the polarization induced by a virtual pair of wave number 
k. As we have seen, quantum physics tells us they effectively exist for a time 
At ~ h/E, but the rest of the analysis is completely classical. The equation 
of motion for a charge e acted upon by an electric field E is, 



dt 



mv 



eE . 



(83) 



Over an interval as short as (j82l) we can regard the energy of the charge as 
constant, 

m E 



1— f^/c^ 



(84) 



We can also forget about the variation in E{x) as the particle moves, so the 
induced deviation in time At = h/E is, 



Ax 



ec^At' 
2E 



2^2 



E 



eh c 



E . 



(85) 



It remains only to add the electron and positron contributions, and then 
sum over modes to find the total induced polarization, 



P(x) 



(2^ 



2^3 



(86) 



Expression fl86l) is proportional to the electric field, just like a linear, isotropic 
medium, so we can identify the change in the permittivity as. 



Ae 



e^h^^ 
27r^hc 



d^k 



1 



(27r)3 [{mc'^y + {hcky]l 
{hckf 



d{hck) 



(87) 
(88) 



Our expression for Ae diverges logarithmically! On the other hand, this 
effect is ubiquitous because it comes from vacuum fluctuations, without any 
medium being present. That means we will never observe Ae independently 
of eo, only their sum0 

e = eo + Ae . (89) 



"'^ Readers familiar with quantum field theory will recognize that e is proportional to Z2, 
the fermion field strength renormalization. 
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It is this sum which must be finite, not Ae or eo separately. So what par- 
ticle theorists do is to adjust the parameter in the field equation cq to be 
conventional value of about 8.85 x 10"^^ F/m minus Ae. That is how renor- 
malization works. 

Some people find it disconcerting to have a parameter from the field 
equations changed when it appears in physical predictions. However, much 
of this sense of wrongness derives from insufficient experience with nonlin- 
ear systems. In a linear system, such as electrodynamics becomes when its 
sources are held fixed, there is indeed a simple relation between parameters 
in the equations and physical predictions. For example, a stationary point 
charge q induces a Coulomb field, 

p{t,x) = q6%x) =^ E{t,x)= . (90) 

The parameters q and eo that enter the classical field equations are the same 
ones which appear in the observed long range field. But even electrodynamics 
becomes nonlinear when one permits its sources to respond to electromag- 
netic fields, and this generally causes the observed quantities to differ from 
their cognates in the equations. For example, the conduction electrons in a 
metal behave, for many purposes, as if they are free to move inside the metal, 
but with a charge and mass different from their values in empty space. 

Renormalization is the hallmark of nonlinear systems, even classical ones. 
We encountered it in quantum electrodynamics because that theory is nonlin- 
ear, not because it is quantum mechanical. Similar effects occur in nonlinear 
classical systems. For example, the combined mass of the Earth-Moon sys- 
tem is a little less than the sum of their masses in isolation, owing to their 
gravitational interaction energy, 

GMeMm (7x10-11 ^)(6x10^^ kg) (7 x 10^^ kg) 
Remc^ ^ (4x 108 m)(3 X 108 si)2 ~ x g. 

(91) 

Nor is the Earth's mass equal to the sum of the masses of its constituents. 
If we imagine it to be a uniform sphere the actual mass is less by about. 



3GM| 3(7 X 10-11 ^)(6x lO^^kg; 



2 



5i?£;c2 5(6x 106 m)(3 X 108 f)2 ^ 3 X 1015 kg. (92) 

This "gravitational renormalization" effect obviously becomes stronger the 
more compact the mass is. Arnowitt, Deser and Misner have shown the 
renormalization is actually infinite for a point mass |13j . 
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Some people are resigned to renormalization in principle, but disturbed 
by the fact that quantum field theoretic renormalizations involves divergent 
quantities. This bothered even the physicists who devised renormalization! 
They eventually accepted it for two reasons: 

• As I have emphasized, renormalization is inevitable in nonlinear sys- 
tems, so we would need to choose eo to make the observed quantity 
eo + Ae agree with experiment, even if Ae had been finite; and 

• Once this is done, along with the analogous things for the electron 
mass and charge, all other quantum electrodynamic corrections are 
quite small and in wonderful agreement with experiment. 

To see this last point let us return to vacuum polarization and do a better 
job of accounting for the spatial variation while still (incorrectly) ignoring 
temporal variation. The fundamental field equation is Gauss's law, 

eoV ■ E{t, f) = I ^*{t, x)^iit, x) . (93) 

If the electric field is not constant in space we need to compute the vacuum 
polarization of each wave vector p. Taking the spatial Fourier transform of 
the source and using Parseval's theorem gives, 

Jd^xe-'P^-^^^^:{t,x)^,{t,x) = I J-0-^^*{t,k)^{t,p-k) . (94) 

From the free field mode sum (!35|) it is apparent that there are two energies 
involved, not the single one of expression (IHOj) . 



E{k) = \J [mc^Y + (/ic)2||A;P and E{p—k) = \J {mc^Y + (/ic)^||p— fcp . 

(95) 

In place of expression ( ISTI) we should expect the two operators to contribute 
as follows, 

^!*{t,k) ^ e^^?-^^]yih\k,s) , (96) 

^{t,p-k) — > exp[^^^^^^] X c\p-k,-s) . (97) 

Now recall that the induced displacement we computed in fl85l) involved 
the inverse third power of "the" energy. As we saw, there are really two en- 
ergies and it turns out that it is their sum which appears in the full quantum 
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field theoretic result. There is also a factor of |, so we can write the induced 



polarization as, 

P{p} 



■ 2f,2 2 

-e h c 



(27r)3 [E{k) + E{p-k)f 



X E{p) 



(98) 



We can write the position space result in terms of a momentum dependent 
shift in the permittivity, 



P(x) 



d^p 
(2^ 



e^P-^Ae{p) Jd^x' e''^'^' E{^) 



(99) 



where the momentum dependent permittivity shift is, 

, , ^ 8e2 /• d^k h^c^ 
Ae{p) 



3hcJ {2tt)^ [E{k) + E{p-k)f 



(100) 



One way of understanding fllOOp is by expanding the energy denominator 
around p = 0, 



E{k) + E{p-k) = 2E{k) 



1 J- + 



K'c^k-p)' 
2E\k) ' 4E\k) AE\k) 



+ ... 



(101) 



Substituting just the first term of (110 II) into (llOOl) gives (|x) the logarith- 
mically divergent expression fl88l) we called Ae. The contributions from all 
the higher terms of the expansion (1101 1) are finite. Using some mathematical 
methods that were developed for quantum field theory they can be evaluated 
to give. 



Aefin(p) 



rfrrfl- 



-r 



In 



2^2 . 



l+r(l-r 



h p 



We can therefore think of the total permittivity as, 

e(p) = eo + Ae(0) + Aefin(p) = e^eas + Aefin(p) 



(102) 



(103) 



where emcas — 8-85 x 10^ F/m is the measured value of the electric permit- 
tivity. 

Expressions (1102111031) illustrate why particle theorists are so pleased with 
renormalization. First, all the divergences are gone, as promised. Second, the 
quantum corrections from Aefin(p) are small, both because of the initial factor 



24 



of and also because the ratio fi^p^ /m^c^ is minuscule for most applications. 
If we express the wave number in terms of a wave length, p = 2ti/\ then the 
ratio is, 



Even at the Bohr radius of about A ~ 5 x 10~^^ m the ratio (11041) would be 
only about 10^'^. Finally, and most important of all, the small effects from 
Aefin(p) have been verified in many experiments. 

One of the strangest effects from Aefin(p) is that the electrodynamic force 
grows stronger at short distances. During the 1990's electrons and positrons 
were brought to within about A ~ 10^^^ m at colliders such as the SLC 
at Stanford University and LEP at the European Nuclear Research Center 
(CERN). At these separations the fractional change in permittivity is. 



Because the electric force goes like 1/e, this 2% reduction in the permittivity 
implies a 2% increase in the force, and that is just what was seen. The phe- 
nomenon is known as running of the coupling constants. We can understand 
it very simply from the fact that increasing the wave number p increases the 
energy of the virtual electron-positron pair, which decreases the time they 
can exist and hence the degree to which they can polarize the vacuum. So 
the charge screening at high p must be less than at low p. 

Because the force fields of the strong interaction attract one another, it 
turns out that the strong interaction becomes weaker at large p. Discovering 
this in 1973 was what won the 2004 Nobel Prize for Politzer [T4j, Wilczek 
and Gross [I5]. Our belief that we know the correct theory of the strong 
interaction is mostly based upon pushing to very small separations, at which 
point perturbative predictions from this theory become reliable. The curious 
fact that strong interactions are much stronger than electromagnetism at low 
energies, and grow weaker at high energies, whereas electromagnetism gets 
stronger, is one thing which makes particle physicists suspect both interac- 
tions are part of a Grand Unified Theory whose unity becomes manifest at 
very high energy. 

We are finally ready to consider how quantum matter affects the Einstein 
equation ([9]). For definiteness let us focus on the 0th order electromagnetic 




(104) 




(105) 
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contribution to the energy density (HM 
8nG 1 



c4 2 



E{t,x) ■ E{t,x) +c^B{t,x) ■ B{t,x) . (106) 



At lowest order this is a product of two free field mode sums, just like the 
charge density of quantum electrodynamics, 

|^*(t,f)^(t,f) . (107) 

As might be expected, the terms they induce on the right hand side of their 
respective field equations are very similar. For quantum electrodynamics in 
the static limit we got, 

^ ^^^^-i(2vr)3^ h'^-f[E{k) + E{p-k)f ^^'^- 
For quantum gravity the analogous result takes the form, 

where h^y{t^x) is the deviation of the metric from its quiescent value and I 
have suppressed its indices in (11091) because more than one component can 
contribute to any one of the ten Einstein equations. The symbol 8^ stands 
for any combination of four quantities built from k and p—k and having the 
dimensions of energy'^. Possible values for include, 

{hc\\p\\f , [h^^k-{p-k)f , E\k)E\p-k) and [E{k)+E{p-k)]\ (110) 

If (I109P represents a contribution from electrodynamics then the energy of a 
photon with wave vector k is E{k) = hc\\k\\. 

There are two outstanding differences between the quantum electrody- 
namic result (llOSp and its quantum gravitational analogue (11091) : 

• The quantum gravitational integrand (11091) contains four factors of en- 
ergy in the numerator as opposed to only two in the quantum electro- 
dynamic numerator ( 1108^ : and 

• All four of the energy factors S"^ in the quantum gravitational integrand 
(11091) can involve k whereas the two factors of ^c||p|| in the quantum 
electro dynamic integrand (11081) do not. 
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The first difference derives from the fact that stress-energy is the source 
for gravity in general relativity, whereas the source for electromagnetism is 
charge. The second difference derives from the fact that the 0-point energy 
of a bosonic mode such as a photon is always positive (the 0-point energy 
of a fermionic meode such as an electron is always negative) whereas the 
same wave vector k contributes both positive and negative charges. That is 
why one gets no coherent effect from the undisturbed charges of a neutral 
medium; the lowest coherent effect in quantum electrodynamics comes from 
the difference of positive and negative charges subjected to an electric field, 
whereas quantum general relativity receives a coherent effect from the 0-point 
energy of each mode. 

The two differences I have noted explain why the divergences of quantum 
general relativity are so much worse than those of quantum electrodynamics. 
To be quantitative, let us evaluate the k mode sum of (11091) with the second 
possibility from the list (IllOp for the numerator factors After some 
tedious but standard reductions we reach the form, 

8nG f d^k [h^^k-{p-k)f 



c4 J (27r)3 [hc\\k\\+hc\\p-k\\Y 



dxx(l-x) dkk- '-^ ^ 3 — V UL^ (III) 

vrc^^o 7o [k^ + x{l-x)p^]2 

Expression (II 111) is a quarticly divergent integral, which is typical of the first 
order corrections in quantum general relativity. To make it well defined I 
will cut off the upper limit aX k = K. This procedure is an example of a 
technique known as regularization which is employed to make mathematical 
sense of the divergences of quantum field theory before they are removed by 
renormalization. I should really have done it at the beginning of the compu- 
tation, with a better technique, and used it consistently throughout (which 
is, rest assured, the standard practice) but this level of rigor is superfluous if 
one just wants an explanation of the problem without precise numbers. 

Once regulated, expression (lllip becomes well defined. We can evaluate 
it exactly but it suffices to give the expansion for large K, 

^Gh f\ , r^^^ ^^[k* + a-fx+fx^)kY+x\l-x)Y] 
' dxx(l — x) dkk — 



3Gh(K^ K^p^ 61 4, /2fCN 190921 ^ 
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All contributions to (11091) have this same form. It is useful to break them 
up into a part that just has positive powers of but diverges when K goes 
to infinity and another part which can have logarithms of p or even inverse 
powers, but remains finite. For the logarithm term this breakup requires the 
introduction of a fixed length scale L, 



In 



2K. 



\n{2LK) - ln{Lp) = - ln{L'^K'^) - - In(LV) + ln(2) . (113) 

We can therefore express the first order electromagnetic correction to the 
Einstein equations as the sum of divergent and finite parts having the form, 



-"div / 



X) 



->fin , 



X] 



GTi 
IF 

Gh 
Gh 



d?p 
(2^ 



AK^ + BK'p' + C\n{L'K') p" x h{p) ,(114) 



AK^h{x) - BK^V\{x) + C\n{L'K^)V*h{x) 



d?p 
(2^ 



-G\n{LY)p^ + Dp^\ X h{p) 



;il5) 



;ii6) 



Here A, B, C and D are pure numbers of order one and I am still suppressing 
the indices of the field /i^j,. Had I considered 0-point contributions from a 
massive field, such as the electron, the resulting Q^^ would have had a more 
complicated structure involving the mass, but the divergent part would have 
had the same form as (11151) . 

One might think it is the most highly divergent parts of (IllSp which 
cause problems for quantum general relativity but that is not so. We can see 
this by taking the static, linearized limit of the Einstein equations (jH]), and 
including the effect we have just discussed from quantum 0-point motions, 

SttG 



V^h + Ah 



T, 



;ii7) 

fiu ' C** ^ ' ^ linear 

(Because Qfj^y already includes the effect of matter 0-point motions one should 
think of the linearized stress-energy tensor as a classical source, just like 
Pfree in our discussion flTHj) of electromagnetic polarization.) Now multiply 
(11171) by c^/SttG and bring the Q^y terms to the left hand side, as we do for 
electromagnetic polarization. 



Chc\n{L^K^ 



V'h + 



+ 



BhcK^ 



+ 



8nG »7r 
c^A AhcK^' 



V'h 



StiG 
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h 



linear 



;ii8) 
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We see that the quadratic and quartic divergences can be absorbed into 
renormahzations of Newton's constant and the cosmological constant, 



Just as neither eo nor Ae(0) is separately observable in electrodynamics, so 
it is only the combinations flll9H120]) which are observable in gravity. 

Alas, there is no parameter in general relativity with which to absorb 
the logarithmic divergence! If only there were then the remaining, finite 
quantum gravitational effects from would be unobservably small. For 
example, the fractional change in Earth's surface gravity due to quantum 
gravitational effects would be about, 

i| ^ 10-- X . ,121, 

which is negligible even if L is chosen to be the Planck length of about 
j^g-35 ^ However, infinity is not small, and that is what one gets from 
the logarithmic divergence. One can only absorb it if new, 4th derivative 
terms are added to the gravitational field equations, but I will show in the 
next subsection that doing so would make the universe blow up. That is 
the fundamental obstacle to making sense of perturbative quantum general 
relativity. 

Before concluding the discussion of renormalization I need to comment on 
three more issues. The first is that Einstein's equations are not linear in the 
field hfj^iy and I have only considered quantum corrections which are linear in 
this field. Might there be additional divergences on nonlinear terms? There 
are indeed such divergences but the conservation of stress-energy prescribes 
how the various powers of h^^ can enter the field equations. With either zero 
derivatives or two derivatives of the full metric the results are unique, 

a° =^ Ag,,, (122) 



,2 



A{g)d'g + B{g)dgdg . (123) 



These correspond to the two terms in the Einstein equations ([9]), and knowing 
their linear parts in hf^^, dictates all higher (and lower) powers as well. 
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The second point is that I have so far worked in the static hmit. That 
is not even correct for classical electrodynamics! Real media cannot polarize 
infinitely rapidly, so dielectric response is frequency dependent. To compute 
the actual polarization one must first Fourier transform the electric field 
in time, as well as space, then multiply by the frequency and wave vector 
dependent permittivity e{uj, k), and only then transform back, 

P(t,x) = /— e-^"*/-^e*^x e(cu,^) x /rft' e*"*7rfV e-*^-""E(t, f ) . 
J 271 J {2tt)'^ J J 

(124) 

This obviously raises issues about causality! Those issues can all be resolved 
but doing so in quantum field theory involves a technique known as the 
"Schwinger-Keldysh formalism" which even many particle theorists don't 
understand^] (The original literature is [161 [IZl [H]; for some nice reviews 
see [H].) They have evolved a series of tricks to avoid having to think about 
it and, although I do understand the technique, I employed such a trick to 
avoid a lengthy (and probably not very illuminating) digression to explain 
it. The trick was to study the static limit of no time dependence and then 
appeal to the fact that the Einstein equation is the unique combination of 
the metric and no more than two derivatives which is consistent with stress- 
energy conservation. So the renormalizations (111911120]) which were found in 
the static limit of only spatial derivatives must apply as well for space and 
time derivatives in the full theory. 

Finally, I should comment that stress-energy conservation allows two lin- 
early independent combinations of four derivatives of the metric. Each of 
these terms has general form, 

A{g)d'g + B{g)dgd'g + C{g)d^gd^g + D{g)dgdgd'g + E{g)dgdgdgdg . 

(125) 

They are called, the "i?^ counterterm" and the "C^ counterterm" after the 
curvature scalars that comprise the Lagrangian densities from which they 
descend. The difference between them has to do with how the indices are 
arranged, which I have suppressed. As with (I122p and (11231) . knowing just 
the linearized parts d^h fixes all other powers. 



^An ignorant (and rather pompous) reviewer for the Department of Energy once rec- 
ommended terminating my funding because the Schwinger-Keldysh field equations I had 
derived must be wrong on account of lacking some contributions with which he was familiar 
from the usual formalism! 
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2.5 The Problem with Higher Derivatives 

We have seen that the renormahzation of perturbative quantum general rel- 
ativity requires that the equations of motion be changed to include terms 
with up to four derivatives. Stelle has shown that if such terms are added to 
gravity (which we cannot any longer call general relativity) then the result- 
ing quantum theory is perturbatively renormalizable [5J. However, it is also 
subject to a virulent instability that is totally inconsistent with the observed 
reality of a universe which is 13.7 billion years old. 

This result is very old, and not specific to gravity; it was obtained in 
1850 by the great Russian physicist Ostrogradsky [20j. Ostrogradsky's the- 
orem is that there is a linear instability in the Hamiltonians associated with 
Lagrangians which depend upon more than one time derivative in such a 
way that the dependence cannot be eliminated by partial integration [20] . 
The result is so general that I can simplify the discussion by presenting it in 
the context of a single, one dimensional point particle whose position as a 
function of time is q{t). 

In the usual case of L = L{q, q), the Euler-Lagrange equation is, 

dL d dL ^ (12Q) 
dq dt dq 

The assumption that ^ depends upon q is known as nondegeneracy. If the 
Lagrangian is nondegenerate we can write (11261) in the form Newton originally 
laid down for the laws of physics, 

q = J^{q,q) =^ qit) = Q{t,qo,qo) ■ (127) 

From this form it is apparent that solutions depend upon two pieces of initial 
value data: go = Q'(O) and go = Q'(O). 

The fact that solutions require two pieces of initial value data means that 
there must be two canonical coordinates, Q and P. They are traditionally 
taken to be, 

dL 

Q = q and P = ^- (128) 

oq 

The assumption of nondegeneracy is that we can invert the phase space 
transformation (11281) to solve for g in terms of Q and P. That is, there exists 
a function v{Q, P) such that, 

^ =P- (129) 
oq 
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For example, one finds f (Q, P) = P/m for tlie liarmonic oscillator (IT^ . 
The canonical Hamiltonian is obtained by Legendre transforming on q, 



H{Q,P) = Pq-L, (130) 
= Pv{Q,P)-L(Q,viQ,P)) . (131) 



It is easy to check that the canonical evolution equations reproduce the 
inverse phase space transformation ( 1129P and the Euler-Lagrange equation 



• OH ^dv dL dv 

• _ dH dv dL dL dv dL 

" '~dQ^~ dQ ^ 'd^ ^ 'd^dQ ^ 'd^ ' ^^^^^ 

This is what we mean by the statement, "the Hamiltonian generates time 
evolution." When the Lagrangian has no explicit time dependence, H is also 
the associated conserved quantity. Hence it is "the" energy by any usual 
standard, of course up to canonical transformation. 

Now consider a system whose Lagrangian L{q, q, q) depends nondegener- 
ately upon q. The Euler-Lagrange equation is, 

^_A^ + il^-0 (134) 
dq dt dq dt"^ dq 

Non-degeneracy implies that ^ depends upon g, in which case we can cast 
(11341) in a form radically different from Newton's, 

= ^(g, g(3)) ^ g(i) = Q(^t, go, go, go, gS'^) • (135) 

Because solutions now depend upon four pieces of initial value data there 
must be four canonical coordinates. Ostrogradsky's choices for these are, 

dL d dL 
dL 

Q2 = q , ^2 = ^- (137) 

The assumption of nondegeneracy is that we can invert the phase space 
transformation (I136II1371) to solve for g in terms of Qi, Q2 and P2- That is. 
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there exists a function a(Qi, Q2, P2) such that, 



dq 



= P2 ■ (138) 

q—a 



Note that one only needs the function a{Qi,Q2, P2) to depend upon three 
canonical coordinates — and not all four — because g, q) only depends 
upon three configuration space coordinates. This simple fact has great con- 
sequence. 

Ostrogradsky's Hamiltonian is obtained by Legendre transforming, just 
as in the first derivative case, but now on g = q^^^ and q = q^'^\ 

H{Q^, Q2, Pi, P2) = E P^(l^'^ - L , (139) 

i=l 

= P1Q2 + P2aiQu Q2, P2) - l{Qu Q2, a{Q,, Q2, P2)) ■ (140) 
The time evolution equations are just those suggested by the notation, 

■ OH ^ ■ OH , , 

Q.._ P..-—. (141) 

Let's check that they generate time evolution. The evolution equation for 

Qi = ^ = Q2 , (142) 

reproduces the phase space transformation g = (^2 in (11371) . The evolution 
equation for Q2, 

■ dH „ da dL da 

reproduces fll38p . The evolution equation for P2, 

• dH da dL dL da ^ dL . ^ ^. 

reproduces the phase space transformation Pi = f| ~ (11361) . And the 
evolution equation for Pi, 

• dH da dL dL da dL 

^^~dQ~i^~ ^dQ'i^'d^^'d^dQ'i^'d^ ' ^ ' 
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reproduces the Euler-Lagrange equation fll34p . So Ostrogradsky's system 
really does generate time evolution. When the Lagrangian contains no ex- 
plicit dependence upon time it is also the conserved Noether current. By 
any standard definition, it is therefore "the" energy, again up to canonical 
transformation. 

There is one, overwhelmingly bad thing about Ostrogradsky's Hamilto- 
nian fll40p : it is linear in the canonical momentum Pi. Note the power 
and generality of the result. It applies to every Lagrangian L{q, q, q) which 
depends nondegenerately upon g, independent of the details. The only as- 
sumption is nondegeneracy, and that simply means one cannot eliminate q 
by partial integration. This is why Newton was right to assume the laws of 
physics take the form fll27p when expressed in terms of fundamental dynam- 
ical variables. 

The Ostrogradskian instability is not a problem with the potential energy, 
in which the dynamical variable can reach arbitrarily negative energies by 
approaching a special value. It is instead an instability of the kinetic energy 
in which arbitrarily negative energies are associated with a special sort of time 
dependence. The problematic term in Ostrogradsky's Hamiltonian fll40p is 
PiQ2- One makes its large by adjusting the third time derivatives in Pi, 
which can be done while the dynamical variable q{t) is still quite small. 

At this point I need to debunk the misconception that unstable systems 
decay "because the system wants to lower its energy." Total energy is con- 
served in fundamental theory, so the decay of an excited atomic state into 
the ground state atom plus some photons leads to no change of energy. What 
drives the decay is entropy: quantum systems explore the space of classical 
configurations which have the same energy, and there are many more ways 
for an atom to decay as opposed to only one for it not to decay. For example, 
the decay photons can go off in any direction. 

This insight about what drives decays means that the Ostrogradskian 
instability is instantly fatal for an interacting field theory. Recall that each 
Fourier mode of such a theory contributes its own, independent kinetic en- 
ergy, and there are an infinite number of such Fourier modes. One's usual 
intuition about modes with large \\k\\ is completely wrong for theories which 
possess the Ostrogradskian instability. We are used to thinking that these 
modes cannot be excited because doing so costs a large energy and there is 
only a finite free energy available. But that is only true when all modes carry 
positive energy. Exciting a negative energy mode frees up energy., which can 
then be used to excite positive energy modes. And exciting a negative energy 
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mode with higher frees up even more energy. Now use expression fl54|] 
to count up the number of modes per unit volume which have ||fc|| < K, 




Of course there is no hmit on K because the higher modes actually participate 
more strongly when the Ostrogradskian instability is present. So one can see 
that an interacting field theory with the Ostrogradskian instability decays 
instantly, no matter how weak the interaction is. 

A final point is that the problematic term P1Q2 of the Ostrogradskian 
Hamiltonian (11401) can have either sign. We have been concerned with the 
fact that it can be arbitrarily negative, but it can also be arbitrarily positive. 
This means that two things happen when you add a higher derivative term 
to a lower derivative theory: 

• There can be changes in the original, lower derivative degrees freedom; 



• The higher derivative term introduces new degrees of freedom which 
carry the opposite kinetic energy to the changed, lower derivative de- 
grees of freedom. 

The usual case is that the lower derivative theory has positive energy, so 
adding a higher derivative induces negative energy degrees of freedom. That 
is what happens with the counterterm; it adds a negative energy, spin 
two graviton which would make the universe decay instantly. However, it 
turns out that the B? counterterm adds a positive energy, spin zero particle 
which is harmless. This represents no violation of Ostrogradsky's theorem 
because the spin zero part of the metric in general relativity carries negative 
energy. It is better known as the Newtonian potential and its negative energy 
poses no problem for stability because this part of the metric is completely 
determined by the stress-energy tensor. The new spin zero degree of freedom 
induced by the E? counterterm is an independent, purely gravitational degree 
of freedom, just like the gravitons. 

To summarize, although adding the counterterm to general relativity 
would be no problem, adding the counterterm would make the universe 
blow up. If only we didn't need the counterterm! But careful analyses 
show that we do need it for scalar particles like the Higgs 0, for electromag- 
netism [3] and for the particles which carry the weak and strong interactions 





(146) 



and 
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[1]. We do not need it for pure gravity at first order in perturbation the- 
ory [2], which is why I have emphasized that the basic problem of quantum 
gravity concerns quantum matter effects which must be present, whether or 
not there is gravitational radiation or it is quantized. That is not to say 
gravitons pose no problems. A heroic second order computation by Goroff 
and Sagnotti fB], verified by van de Ven [7], demonstrates that they induce 
a higher derivative counterterm as unacceptable as the counterterm. 

2.6 The Impact of Primordial Inflation on Two Fixes 

Inflation is defined as a period of accelerated cosmological expansion. We 
know this can happen because it's taking place right now [HI E]. Guth has 
proposed that a very early phase of primordial inflation would explain why 
the current universe is so nearly homogeneous and isotropic on the largest 
scales, why it is so nearly spatially fiat, and why it contains no exotic relics 
such as magnetic monopoles (which typically occur when all the forces are 
unified) and primordial black holes [21] . There is a lot of evidence in favor of 
this idea and none against it, although physicists are not yet ready to regard 
it as proven. I shall have a lot more to say about primordial inflation in 
section 5 but let me for now explore the consequences for quantum gravity 
of three tenets of primordial inflation: 

1. Quantum gravitational fluctuations in the metric at the beginning of 
inflation were no larger than about one part in 10^; 

2. The universe has expanded by a factor of at least 10^^; and 

3. The structures of today's universe derived from 13.7 billion years of 
gravitational collapse into the tiny (one part in about 10^) inhomo- 
geneities provided by quantum fluctuations of the stress-energy tensor 
near the end of inflation. 

Assumptions 1-3 can be used to rule out two proposals which are some- 
times advanced for resolving the apparent inconsistency of perturbative quan- 
tum general relativity: 

• Regard all components of the metric as classical and change the source 
of gravity from the quantum stress-energy tensor to its expectation 
value in some state; or 
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• Regard spacetime as discrete at some very small length scale. 

Of course assumption #3 immediately falsifies the first proposal. If inflation 
is correct then the expectation value of the stress-energy tensor at the end 
of inflation cannot retain inhomogeneities of more than about one part in 
10''^, otherwise they would have been so big at the beginning of inflation 
that gravitational collapse would have ensued. But the measured strength 
of primordial perturbations in the cosmic microwave background and in the 
matter density is about one part in 10^ [23! • ■'■ remark in passing that, 
if inflation is correct, primordial perturbations are the first ever data from 
quantum gravity. One can see how having this data has shifted debates over 
quantum gravity from philosophy and aesthetics to the interpretation of hard 
evidence. Quantum gravity is coming of age. 

One has to work a little harder to debunk discretization. We only have di- 
rect evidence for the continuum nature of spacetime down to about 10~^^ m. 
If space and time were discrete at some smaller scale AL the strength of 
quantum gravitational corrections would be roughly what one gets from tak- 
ing the spatial momentum cutoff K to be 1/AL. That would remove the 
divergences but one also has to keep quantum gravitational corrections suf- 
ficiently small and, it turns out that discretization cannot accomplish this if 
one accepts primordial inflation. 

First note that the scale of discreteness AL cannot be much smaller than 
about the Planck length of [hG/c^]^ ~ 1.6 x 10~^^ m. This might seem 
surprising in view of the first order results flllSp . 



^div ^ ^] AK^h{x) - BK^V^h{x) +C\n{L^K^)V^h{x) . (147) 



Just because K ~ 1/AL will stay finite does not preclude renormalization, 
so we can still absorb the K'^ contribution into a shift of the cosmological 
constant, and the contribution into a change of the Newton constant. 
That leaves only the ln(L^i^^) correction, which would be minuscule at low 
scales, even if the argument of the logarithm is enormous. 

However, one must consider higher order corrections. If one doesn't add 
4th derivative terms to the classical field equations then the divergences of 
quantum gravity grow worse as the order of perturbation theory increases. 
For example, at second order the divergences would take the form. 



^div ^ fSLYlAK^ + BK^V^ + CK^V^ + D HL^ K^)\/^ U(f) , (148) 
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where A, B, C and D are numbers of order one. One can absorb the 
divergence in the cosmological constant and the K"^ divergence in the Newton 
constant, but the other two divergences must be regulated by the cutoff 
K ~ 1/AL. The term proportional to V^h{x) could be written as the first 
order term times an extra factor of GK'^/hc^, 

(^) X CK^^'Kx) = ^ X I — X CV^x) I . (149) 

The term in brackets is roughly the same strength as the S/^h term we got 
at first order, and very small under normal circumstances, but the initial 
factor of GK^/hc^ would be huge if scale of discretization drops much below 
the Planck length. The third order correction would contain two factors of 
this huge number, et cetera. The only way to avoid eventually getting an 
unacceptably large quantum correction is to prevent the cutoff scale AL from 
dropping below the Planck Length. 

Now consider Assumption j^l from primordial inflation, that quantum 
gravitational effects were small at the beginning of inflation. This means that, 
in terms of the physical length measured at the beginning of inflation, the scale 
of discretization cannot have been below about 10^'^^ m. But Assumption 
7^2 says that the universe has expanded by a factor of at least 10^^ since 
the beginning of inflation. That means the physical length between discrete 
points (the number of which cannot change with time) ought to be about 
10^^ m today! That is roughly the distance between stars in our part of the 
Milky Way galaxy, and of course utterly inconsistent with current, direct 
checks of continuum spacetime to about 34 orders of magnitude smaller. 
Turning the argument around, for the current physical length of discreteness 
to be less than 10^^^ m, its value at the beginning of primordial inflation must 
have been 10""^^ m, which is 34 orders of magnitude too small to explain why 
quantum gravitational effects are small during inflation. 

Note that the same argument can be invoked for any early event during 
which we have reason to believe quantum gravitational fluctuations were 
small. Some events and the associated cosmological expansion factors are: 

Recombination =^ 10^ , (150) 

Nucleosynthesis =^ 10^ , (151) 

Quark Gluon Plasma ^ 10^^ , (152) 

Electroweak Symmetry Breaking =^ 10^^ . (153) 
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Primordial inflation provides a vastly stronger bound because it came much 
earlier, but the expansion since electroweak symmetry breaking (which may 
well have seen the formation of the asymmetry between matter and anti- 
matter) lacks only two orders of magnitude to connect the Planck length to 
the current experimental bound on discreteness. We therefore conclude that, 
while spacetime may well be discrete at some, very small scale, this cannot 
explain what is suppressing quantum gravitational effects. 

3 General Reactions to the Problem 

We have seen that the problem of quantum gravity arises from a conflict 
between four physics principles: 

• Continuum Field Theories possess an infinite number of modes; 

• Quantum Mechanics requires each mode to have a minimum amount 
of energy; 

• General Relativity stipulates that stress-energy is the source of gravi- 
tation; and 

• Perturbation Theory simply adds up the contribution from each mode 
at lowest order. 

When a problem can be shown to derive from a well-defined set of propo- 
sitions then one or more of these propositions must be wrong. In the previous 
section I argued that it cannot be the first two. Although spacetime may 
well be discrete at some level, the expansion of the universe implies that this 
discreteness must be at too small a scale to be useful for making sense of 
quantum gravity. And the lowest order divergences of quantum gravity de- 
rive from the quantum properties of matter, which have been too thoroughly 
checked to abandon. It follows that the problem must he either with general 
relativity or with the use of perturbation theory. The great fault line which 
divides fundamental theorists is which of these two is held suspect. 

3.1 Particle Theorists versus Relativists 

The two major schools of thought on quantum gravity consist of those who 
approach the subject from the perspective of particle theory and those who 
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approach it from the perspective of classical general relativity^ Particle the- 
orists are much attached to perturbation theory, so they are willing to alter 
general relativity. This is what led to supergravity and superstring theory, 
and to study of on-shell finiteness and asymptotic safety. Relativists are 
equally attached to general relativity, so they are willing to ignore perturba- 
tive problems. This is what has led to loop quantum gravity. 

Both views reflect the body of experience of those who hold them. The 
history of particle physics has involved aggressively using perturbation theory 
to derive predictions for proposed models of fundamental interactions, and 
then ruthlessly discarding any model which failed to agree with observation 
and experiment. Among particle theorists the "crackpots" were those who 
became too attached to a particular model and either failed to check it using 
perturbation theory or else refused to abandon the model when perturbative 
checks indicated a problem. 

It should also be mentioned that particle theorists are much attracted to 
the idea of unifying gravity with the other forces. This led to progress, first 
with Maxwell's unified theory of electricity and magnetism and, a century 
later, with the unification of the weak and electromagnetic forces for which 
Weinberg, Salam and Glashow shared the 1979 Nobel Prize. And the fact 
that the electroweak and strong coupling constants become equal at about 
10^^ GeV, and that this energy is close to the Planck energy, seems (to 
particle theorists) to point to a fully unified theory at very high energies. 

Relativists come to quantum gravity with a completely different historical 
perspective. For them general relativity is a model which has stood the test 
of time. Whenever people thought there was a problem which necessitated 
changing general relativity it turned out, upon closer examination of either 
the theory or the data, that general relativity was right and the proposed 
changes were wrong. Among relativists the people who made mistakes were 
those who tinkered with the model on the basis of incomplete data or any- 
thing less than a rigorous theoretical analysis. The first example was none 
other than Albert Einstein, who in 1917 introduced the cosmological constant 
A into his gravitational field equations in order to prevent the universe 
from expanding in the simplest cosmological realization of general relativity. 
Of course Hubble actually quantified this expansion in 1929 [24], and Ein- 

^ There are of course exceptions: string theorists who were trained as relativists and 
loop quantum gravity researchers who were trained in particle theory. I hope they will 
not take offense at the names I have chosen to characterize their disciplines. 
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stein could have predicted it liad he just stuck with his original formulation 
of general relativity. He called this the greatest blunder of his life. 

Relativists are also familiar with a vast collection of "paradoxes" which 
purport to show that either special or general relativity is wrong, and which 
can only be debunked by carefully identifying false assumptions. So it seems 
very natural to a relativist to reject the result of an asymptotic series ex- 
pansion, especially when divergences are present. They distrust the idea 
of considering something as "small" when it is actually divergent, and they 
won't be satisfied that there is anything wrong with quantum general rela- 
tivity until a rigorous proof is supplied which is not based on perturbation 
theory. 

I am myself a particle theorist but I have friends in both camps and it 
is sometimes difficult to make them see any worth in the other side's views. 
Although I share the fondness of my particle colleagues for perturbation 
theory, let me reply to an objection they sometimes raise about doubting the 
validity of perturbative results for quantum general relativity. The objection 
takes the form of an exasperated question: Perturbation theory is supposed 
to be valid when the corrections it generates are small and, whatever is the 
right fundamental theory, quantum gravitational corrections must be small at 
low energies because we have never observed a single one! How then can you 
people refuse to accept a perturbative treatment of quantum general relativity? 

My answer is twofold: 

1. There might be low energy quantum gravitational effects which mas- 
querade as something else; and 

2. Perturbation theory might not be generating the correct asymptotic 
series expansion for quantum general relativity. 

I will have more to say about the first possibility in the next subsection. 
Concerning the second, suppose the actual series expansion consists not of 
just powers of GE'^/hc^ but also logarithms, 

(154) 

Note first that trying to beat this into the form of an expansion in powers of 
G would result in uncontrollable logarithmic divergences, which is what we 
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see in quantum general relativity. Note also that low energy quantum gravi- 
tational corrections would still be unobservably small, just not quite as small 
as without the logarithms. For example, at LHC energies the suppression 
factor would not be the figure of ~ 3.4 x 10"'^^ we got in expression fl69|) . We 
would instead get this number times its logarithm, 

(3.4 X 10"^^) ■ ln(3.4 X 10"^^) ~ -2.4 x 10"^^ . (155) 

The extra factor of about 100 represents an enormous enhancement, but the 
effect is still many orders of magnitude below observability. 

Exotic terms occur in many familiar asymptotic expansions, and diver- 
gences are the typical signature of their appearance. Consider the logarithm 
of the grand canonical partition function for non-interacting, non-relativistic 
bosons of mass m in a three dimensional volume V: 

00 

ln(5) = Vuq exp{kpfj.) . (156) 

fe=i 

Here uq = {mkBT /2TilT?)i is the quantum concentration, < is the chem- 
ical potential, and (3 = {kBT)~^. Near Bose-Einstein condensation one has 
< — <^ 1 so it should make sense to expand ln(S) for small Straight- 
forward perturbation theory corresponds to the following expansion: 

^Vn,tk-it^-!^~ Vn, t^Iftk'-i. (157) 

k=l t=0 1=0 k=l 

Although the £ = and C. = 1 terms are finite, the sum over k diverges for 
(i>2. 

The divergences we have encountered do not mean that higher corrections 
are large, just that they are not as small as (/3/i)^. One sees this by expanding 
the second derivative around its integral approximation: 



1 



92 ln(s) 



Vuq 9(/?/i)2 



exp{kf3ij,) , (158) 

k=l 

dyy~^ exp(?//3/i) + exp(A;/3/i) - / dyy"^ exp(?//3/i) ,(159) 

00 

+ ^[A;-^ -2/^5 +2(A;-1)5 ] +0(/3/i). (160) 



k=l 
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Integration reveals the true asymptotic expansion: 

ln(s) = VriQl^dl) + C(^) Pf^ + IV^i-m'^ + 0(/3V) } • (161) 

The oscillating series of ever- increasing divergences in the perturbative ex- 
pansion fll57p has resolved itself into a perfectly finite, fractional power. If 
only we had the analytical power to check for such behavior in quantum 
general relativity! 

3.2 How We'd Use Quantum Gravity if We Had It 

The unsatisfactory state of affairs in quantum general relativity is that the 
only computational tool we currently possess offers the choice between two 
hopelessly incorrect predictions: 

• Either quantum gravitational effects are infinitely strong; 

• Or else the universe blows up instantly. 

But there is obviously some quantum theory of gravity, because quantum 
matter gravitates, and one can discuss what it would tell us if we found it. To 
be specific, suppose we discovered a consistent and plausible quantum theory 
of gravity whose dimensionless coupling strength is that of quantum general 
relativity, GE"^ /he' ~ (E/IO^^ GeV)^. An irony of this subject is that, as 
soon as we manage to avoid predicting infinitely strong effects, we are almost 
inevitably left with no observable predictions at all because GE'^/hc' is so 
small for any process which can be contrived in the laboratory! 

One might think observable effects could be obtained by resorting to 
large masses, such as that of the Earth. This indeed gives measurable grav- 
itational effects from the incoherent sum over many sources, but quantum 
gravitational effects derive from the energy of only a single mode. So the 
energy E appropriate for quantum gravitational corrections to the Earth's 
potential is not the Earth's enormous rest mass energy of M^c^ but rather 
the minuscule energy E = hc/R of the mode whose wave length is the radius 
R at which the potential is measured. Quantum gravitational corrections to 
the Earth's classical potential of —GMe/R take the form [25], 

GMe r Gh ^ , , 

$ = --^{l + const X -- + ...}. (162) 
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At the surface of the Earth the fractional change would be about, 



on 



,-84 



(163) 




~ 10 



Even if we could measure gravity that accurately (and we cannot), this change 
is vastly smaller than the classical effect from the mass of the human taking 
the data! 

How to observe a very weak interaction is not an unprecedented problem 
in the long history of physics. There are two general approaches: 

• Find a regime in which the interaction is not so weak; or 

• Exploit some unique property of the interaction that gives rise to effects 
for which there is no background from other interactions. 

I will comment on both approaches. 

The obvious way of overcoming suppression by the factor GE'^/hc' is 
to scale up the energy E. We cannot build accelerators which approach 
interesting energy ranges but nature does this for us in four cases: 

• The initial singularity which must occur, on very general grounds, ei- 
ther without primordial inflation [26] or with it [27] : 

• The flnal stages of black hole collapse; 

• The flnal stages of black hole evaporation; and 

• The phase of primordial inflation. 

The flrst three processes can access modes of arbitrarily high energy. The 
flnal one might reach as high as ~ 10^^ GeV, at which quantum gravita- 
tional effects would be small but not negligible. So there are good reasons 
to expect signiflcant quantum gravitational effects in all four cases; the is- 
sue is flnding some signature of them that reaches us here and now. Ideas 
about how to do this for the initial singularity and for black hole collapse 
are still quite speculative, and we have not discovered any black holes near 
the end of their existence. However, there is by now a well-developed formal- 
ism for tracing quantum gravitational effects from primordial inflation to the 
current epoch. The simplest interpretation of the data from anisotropies in 
the cosmic microwave background [22] and from large scale structure surveys 
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[231 is thdX the primordial perturbations in the gravitational potential of our 
universe arose from quantum matter fluctuations near the end of inflation. I 
will review the argumentation in section 5. 

So much for the first approach; the other technique is to identify processes 
driven by some special feature of gravity that no other force possesses. So if 
we see the effect at all, no matter how weak it is, it must be from gravity. 
For example, particle physicists did not discern the weak nuclear force in the 
background of vastly stronger quantum electrodynamic processes but rather 
because it alone mediates decays such as /i~ e'v^Ve- 

There are four special features of gravity which deserve comment: 

• One of the gravitational parameters is the cosmological constant A; 

• It determines the maximum speed at which signals can propagate; 

• Gravitons have zero mass without being driven to zero amplitude by 
the expansion of the universe; and 

• The gravitational interaction energy is negative. 

The third point is what my own current research concerns and it will make 
more sense if I postpone it to section 5. I will discuss the first two points 
briefly and the last one at greater length. 

The cosmological constant A multiplies a term in the Einstein equations 
(ini) without any derivatives. It influences the rate at which the overall ex- 
pansion of the universe is accelerating; positive A tends to make the universe 
accelerate whereas negative A tends to make it decelerate. (The spatially 
homogeneous parts of certain matter fields also play a role.) The current 
universe is accelerating [8], [9], which is consistent with A having a small, 
positive value. 



The measured value (I164p of the cosmological constant is outlandish! 
From equation (11201) one can see that first order quantum gravitational cor- 
rection has the form AA ~ GhK^/ c^, where K is the cutoff wave number. Of 
course one must cancel the divergence when K goes to infinity, but there will 
obviously be a finite remainder which takes the same form with some finite 
K. (As particle theorists used to remark during the period renormalization 
was being worked out, "Just because something is infinite does not mean it 




(164) 
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is zero" [ZB]-) The trouble is that the other scales in physics give values for 
AA which are vastly larger than (11641) . 



3 3 

Planck Scale (k^ = ^) =^ AA = ^ ~ 10^^^ x 10"^^ m"^ , (165) 



Gh^ Gh 



Z Boson Mass fx = ^) =^ AA = ~ 10^^ ^ lO'^^ ^"^ , (166) 

\ n y h 



h 

Electron Mass (k = ^) ^ AA = ~ 10=^^ x lO'^^ ^-2 _ ^-^q^. 

Particle theorists refer to a mismatch of this type as a hierarchy problem, 
and the one associated with the cosmological constant is the worst in all 
of fundamental theory [29] . Of course A is free parameter in the Einstein 
equations ([9]) and it has to take some value, so why not precisely the number 
which gives (11641) ? That could be, but many people suspect we are missing 
a key principle from quantum gravity [30j . 

Because the metric field g^u{t, x) determines physical lengths and times, it 
controls the maximum rate at which signals can propagate. A tiny quantum 
fluctuation in the metric could allow some photons of light from a distant 
galaxy to reach us a little sooner than others. Because we don't know cosmic 
distances very well the potentially observable effects would be a blurring of 
images, fluctuations in luminosity and a broadening of spectral lines [32] . 
Since the earliest days of quantum gravity such effects have been termed 
smearing of the light-cone [33j. Surprisingly, the lowest order contributions 
can be computed using first order perturbation theory and they give finite 
results [31]. These results are still unobservably small, but not by much, and 
they may well be detectable in future laser interferometers p5j. It could be 
that history repeats itself because the first quantum electrodynamic correc- 
tion to the electron magnetic moment is finite, and was derived by Schwinger 
in 1948 [2S] while physicists were still puzzling out how to fully absorb all 
the divergences. 

The final special feature of gravity is its negative interaction energy. That 
raises a fascinating possibility in the context of computing the contribution 
to a particle's measured mass from the interaction with its own force fields. 
Every beginning physics student is taught that the electric potential from a 
collection of charges qi at fixed positions Xi is, 

m = E , 11^: • (168) 

, 47ren kc — a;,- 
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They are also taught to compute the electrostatic interaction energy between 
these charges by summing ^qi^{xi), where is the potential due to all 

the other charges, 

^EM = EE o I? • (169) 

The curious rule about omitting the charge's interaction with its own field 
(which is the strongest contribution!) derives from the fact that setting 
X = Xi in (11681) produces a divergent potential. More advanced students are 
instructed to regulate this divergence and then absorb it into the unobserv- 
able bare mass of the particle in such a way as to make the total self-energy 
of the particle agree with its measured mass. (Cf. chapter 16 of the text by 
Jackson [31j.) 

Arnowitt, Deser and Misner have worked out how this procedure changes, 
on the classical level, when the gravitational interaction is included p!3]. It 
is simplest to model the particle as a stationary spherical shell of radius R 
(that is the regularization) charge e and bare mass mo- In Newtonian gravity 
the shell's energy would be. 

It turns out that all the effects of general relativity are accounted for by 
replacing E^/c^ and mg with the full mass m^. 



2 2 6^ Gwip Rc^ 




(171) 



It should be noted that Arnowitt, Deser and Misner rigorously solved the 
constraint equations of general relativity and electrodynamics, and then used 
the asymptotic metric to compute the ADM mass. They also developed the 
simple model I am presenting |13j . 

The perturbative result is obtained by expanding the square root. 



mpertc^ = rriQC^ + " p + E i 1/34/ \ ■ ■ o r? 

(172) 

and shows an oscillating series of increasingly singular terms. The alternating 
signs derive from the fact that gravity is attractive. The positive divergence 



e2 ^ (2n-3)!! / G V ^ 
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of order / R evokes a negative divergence or order Ge^/R^, which results in 
a positive divergence of order G'^e^/R^, and so on. The reason these terms 
are increasingly singular is that the gravitational response to an effect at one 
order is delayed to a higher order in perturbation theory. 

The correct result is obtained by taking R to zero before expanding in 
the coupling constants and G, 

lim rriRC^ = [ ) ' = ^/^Mpunci.c' ~ 10^' GeV . (173) 

Like the expansion of ln(S) in expression (11611) it is finite but not analytic in 
the coupling constants and G. Unlike the expansion of ln(S), it diverges 
for small G. This is because gravity has regulated the linear self-energy 
divergence which results for a non-gravitating charged particle. 

One can understand the process from the fact that gravity has a built-in 
tendency to oppose divergences. A charge shell does not want to contract in 
pure electromagnetism; the act of compressing it calls forth a huge energy 
density concentrated in the nearby electric field. Gravity, on the other hand, 
tends to make things collapse, especially large concentrations of energy den- 
sity. The dynamical signature of this tendency is the large negative energy 
density concentrated in the Newtonian gravitational potential. In the limit 
of i? — *^ the two effects balance and a finite total mass results. 

Expressed this way, there seems to be no reason why gravitational inter- 
actions should not cancel divergences in quantum field theory the same way 
they do in classical field theory. It is significant that the divergences of some 
quantum field theories — such as quantum electrodynamics — are weaker 
than the linear ones which ADM have shown that classical gravity controls 
[37j . So less cancellation is necessary and one might expect a smaller final 
mass, closer to the values of known charged particles. The frustrating thing 
is that the only computational tool we possess for quantum field theory is 
perturbation theory, and one cannot hope to see the cancellation perturba- 
tively. In perturbation theory the gravitational response to an effect at any 
order must be delayed to a higher order. This is why the perturbative result 
(11721) consists of an oscillating series of ever higher divergences. What is 
needed is an approximation technique in which the gravitational response is 
able to keep pace with what is going on in other sectors. 

Note that any finite bare mass drops out of the exact result (I173p in the 
limit i? — > 0. This makes for an interesting contrast with the usual program 
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of renormalization. Without gravity one would adjust the bare mass tuq to 
be whatever divergent quantity is necessary to produce the measured mass 



e2 

moc^ = m^easC^ - g^^^^ • (174) 
Of course the same procedure would work with gravity as well, 

moc' = m^^^c' - + . (175) 

The difference with gravity is that we have an alternative: keep rriQ finite and 
let the dynamical cancellation of divergences produce a unique result for the 
measured mass. This would fulfill the old dream of deriving particle masses 
from their self-interactions. It would also mean that fundamental particle 
masses represent disguised quantum gravitational effects which would provide 
sensitive tests of the theory of quantum gravity if only we possessed the 
analytical tools to predict them. 



4 Current Approaches to Quantum Gravity 

Many fine physicists have burned away their lives grappling with the prob- 
lem of quantum gravity. There is not space in this article to discuss all their 
efforts, nor do I possess the expertise for it. I will here review five of the 
most popular approaches that are still being pursued. Three of them derive 
from the particle theory belief that general relativity must be changed; this 
common origin and methodology dictates that they should be presented con- 
secutively. The remaining two approaches derive from the relativist's belief 
that quantum general relativity might be alright if studied nonperturbatively. 
I will cite review articles and books specific to each approach but I would 
also like to recommend the general review article by Carlip ^38j. 



4.1 Superstring Theory 

A central point to understanding string theory is that it cannot be formu- 
lated the way all other fundamental theories are, by giving the dynamical 
variables and the equations they obey. We do not know what the fundamen- 
tal dynamical variables of string theory are, nor the equations they obey. 
What we have instead is a formalism for perturbatively computing what is 
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the usual observable of a quantum field theory, the S-matrix0 The reasons 
for this are historical so I will summarize how string theory was developed. 

String theory began in the late 1960 's as an attempt to understand the 
strong interactions. Experiment had shown a series of resonances whose 
mass-squared increases approximately linearly as a function of angular 
momentum J, starting from a positive intercept, 

= +ml + AmV . (176) 

It was obvious to everyone that the strong interactions could not be treated 
perturbatively (no one then suspected that the strong interaction would be- 
come weaker at high energies) so, instead of proposing quantum field theories, 
physicists tried to guess scattering amplitudes which incorporated such reso- 
nances. The first to succeed was Gabriele Veneziano [IQ], who proposed what 
would become known as the 4-particle, open string tree amplitude. Miguel 
Virasoro found the analogous amplitude for a closed string |41j. These 4- 
particle amplitudes were quickly generalized to give A^-particle scattering for 
open [12] and for closed strings [l3]. 

The early string amplitudes had linearly rising resonances (11761) but they 
suffered from three problems: 

• They did not include fermions; 

• They contained resonances with the wrong sign to come from physical 
particles; and 

• They started from a negative intercept, rather than a positive one. 

Particles with the wrong sign are called "ghosts" and they can be viewed 
as making the theory decay instantly through a kinetic instability like the 
Ostrogradskian instability I discussed in section 2.5. Particles with a negative 
mass-squared are called "tachyons." People sometimes make the mistake 
of thinking of them as particles that move faster than the speed of light 
but what they really are is instabilities of the potential energy. In a field 
theory such an instability would not be serious, it just means the field decays 
to some value with a lower potential energy, as happens when electroweak 

''^Maldacena's AdS/CFT correspondence [35] is now widely accepted as providing a 
nonperturbative formulation of string theory (in terms of an ordinary quantum field the- 
ory) for the boundary conditions associated with the most symmetric solution of general 
relativity with a negative cosmological constant. 
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symmetry breaking takes place in the Standard Model. But string theory 
is not based upon a field theory so there is no principle to tell us how the 
perturbative background shifts — or even what the perturbative background 
is on the fundamental level. All we have is the S-matrix about a handful of 
backgrounds, and the appearance of a tachyon in the spectrum implies that 
this S-matrix cannot be correct. 

It was my University of Florida colleague, Pierre Ramond who worked 
out how to incorporate free fermions in 1971 [44]. By this time it had been 
recognized that string theory scattering amplitudes can be written as inte- 
grals with respect to the coordinates of a string X'^{a), similar to the way 
that ordinary quantum field theory scattering amplitudes can be written as 
integrals with respect to the spacetime coordinates of a point particle x^. In 
neither case are the true dynamical degrees of freedom these coordinates; for 
quantum field theory the true dynamical degrees of freedom are the various 
fields, no one knows what they are for string theory. 

Shortly after Ramond's work, Neveu and Schwarz added a new kind of 
string fermions to produce the amplitudes for interacting bosons [43]. Al- 
though these models had supersymmetry in the string coordinate space, the 
amplitudes themselves were not supersymmetric. In 1976 Gliozzi, Scherk 
and Olive showed how to get supersymmetric amplitudes by combining Ra- 
mond's formalism with that of Neveu and Schwarz, and then projecting out 
a certain sector which includes the problematic tachyon [45]. This was the 
birth of superstring theory. 

Ramond's work is tremendously significant for quantum gravity because it 
represents the first appearance of a supersymmetry which connects fermions 
and bosons. There are many interesting things about supersymmetry but its 
importance for quantum gravity derives from a fact I mentioned in section 
2.4: the 0-point energies of bosons and fermions with the same mass m and 
wave number k = \\k\\ cancel. 

Bosons ^ +^y'm2c4 + {hck^ , (177) 

Fermions =^ — --y/m^c^ + {hcky . (178) 

Supersymmetry involves a tight relation between fermions and bosons, which 
is necessary if this cancellation is to do any good for quantum gravity. With 
this tight relation, every correction from the 0-point motion of bosons in 
the theory tends to be canceled by an opposite correction from the 0-point 
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motion of fermions. 

At this point I must digress to discuss the imphcations supersymmetry 
has for the cosmological constant A. Unbroken supersymmetry is only con- 
sistent with A being zero or negative, not positive. If it exists in nature 
then supersymmetry must be badly broken at low energies because unbro- 
ken supersymmetry predicts that every known particle has a "super-partner" 
(bosonic super-partners for known fermions and fermionic super-partners for 
known bosons) with the same mass, and not a single one of these super- 
partners has been observed. 

One does not need superstring theory in order to have supersymmetry. 
In fact, supersymmetric algebras and/or quantum field theories were devel- 
oped before superstring theory by Golfand and Likhtman [?7j , by Volkov and 
Akulov [H], by Volkov and Soroka [19] and by Wess and Zumino [50]. Su- 
persymmetry had also been extended to gravity to produce a class of models 
known as supergravity by Freedman, van Nieuwenhuizen and Ferrara ^Slj and 
by Deser and Zumino [52]. (I shall have much more to say about supergravity 
in the next subsection.) A fact of great importance for this exposition is that 
the cosmological constant can have any sign in generic theories of gravity but 
it can only be negative or zero in supergravity and superstring theory. So 
the only way supergravity or superstring theory can be consistent with the 
observed acceleration of the current universe [H [9] is to suppose that we are 
now in a metastable state which will eventually decay to a universe which is 
either decelerating or actually contracting. 

Recall that the initial string amplitudes had three problems: no fermions, 
ghosts and a tachyon. We have just seen that superstring theory resolves the 
first problem and the third one. My University of Florida colleague, Charles 
Thorn was among those who proved that the ghosts drop out in certain 
key dimensions: D = 2Q dimensions for bosonic string theory and D = 10 
dimensions for superstrings [531 ISl] • 

The "no-ghost" theorems actually came in 1972, before the invention of 
superstring theory, so string theory still had the tachyon problem. It also had 
massless particles: a spin one particle in the open string amplitudes and a spin 
two particle in the closed string amplitudes. Although these particles pose 
no problem for stability, they are no more part of the observed spectrum of 
strongly interacting particles than tachyons. The funny dimensions dictated 
by the no-ghost theorems also seemed to preclude using string theory to 
describe the strong interactions. And recall from section 2.4 that Politzer 
[2] and Wilzcek and Gross [I^ showed in 1973 that the currently accepted 
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theory of the strong interactions gets weaker at high energies, which meant 
perturbation theory can be used to make predictions. These predictions were 
confirmed, by which point few people were interested in string theory. 

In 1974 Joel Scherk and John Schwarz made a virtue of the massless parti- 
cles by showing that they could be the photon and the graviton, respectively 
[53] . A year later Scherk and Schwarz proposed that the six extra dimen- 
sions of (what would become) superstring theory were "compactified" 
A dimension which is compactified does not extend to macroscopic distances 
the way the three known spatial dimensions do; instead it is rolled up into a 
circle (or more complicated shape) of radius so small that we cannot observe 
motion in this direction. With the advent of superstring theory the next year 
all the ingredients were in place for a potential theory of everything. 

At first very few people were interested. The majority of particle physi- 
cists were working out the consequences of what would become known as 
the Standard Model. More and more people also began studying the Grand 
Unified Theories which were suggested by the fact that the three Standard 
Model coupling constants become comparable at very high energies. Proper, 
mainstream particle theorists did not worry about quantizing gravity! I recall 
one of my graduate professors being asked for a reference on supersymmetry 
and supergravity and replying, with lofty disdain: 

Supersymmetry is one of those subjects which we here at Harvard 
try to discourage students from studying. 

Students who wished to study quantum gravity at Harvard did so through 
a sort of "underground railroad" in which Sidney Coleman signed the forms 
but Stanley Deser was our true adviser. 

Supersymmetry was eventually accepted by particle theorists as a way of 
solving a hierarchy problem somewhat less severe than the one I discussed 
in section 3.2 with regard to the cosmological constant. Supersymmetry also 
allowed particle theorists to continue using perturbation theory, which is our 
chief analytical tool, to very high energies. Quantum gravity got accepted 
shortly thereafter. Part of the reason for this is that the scale then envisaged 
for Grand Unification (10^^ GeV) is only three orders of magnitude below 
the Planck scale (10^^ GeV). Another reason is that the extra components of 
a higher dimensional metric (it has \D[D + 1) components in D spacetime 
dimensions) could be regarded as the other kinds of bosonic particles, thereby 
unifying gravity and the other forces. For example, the 15 components of 
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a 5- dimensional metric could be regarded as the ten components of our 4- 
dimensional metric, plus a 4-component vector potential and a scalar particle. 

Superstring theory was rehabilitated because higher dimensional general 
relativity isn't a very good way of unifying all the forces. For one thing, 
it wasn't clear how to make it incorporate fermions correctly. For another, 
increasing the number of dimensions makes the divergence problem worse. It 
is easy to understand why from the discussion of section 2.4. Recall that the 
first order gravitational response to 0-point motion in a static gravitational 
field of wave number p involves a divergent mode sum of the form, 

/ ^ ^ = AK^ + BKY + Cln(K'')p'' + Finite, (179) 

where consists of a variety of terms quartic in p and k, K is the cutoff and 
A, B and C are numbers of order one. The details of how extra dimensions are 
compactified don't matter in the regime of large wave numbers which gives 
rise to the problem. So going to six spacetime dimensions amounts to chang- 
ing the wave vector in (11791) from k = {ki, k2, k^) to k = (fci, k2, k^, k/i, k^) to 
give, 

/ ±^ ^ ^ = AK6^EirV + CirV + Dln(ir2x e^pi^i^g _ 

J {27if[E{k) + E{p-k)f 

(180) 

This requires even more unacceptable counterterms than the 4-dimensional 
theory! 

Interest in superstring theory surged in 1984 when Green and Schwarz 
showed that the theory naturally incorporates the right kinds of fermions 
and is also likely to be finite [57j. The statement about finiteness might seem 
surprising because the no-ghost theorems require superstring theory to exist 
in D = 10 spacetime dimensions and I have just explained that increasing 
the dimension makes the divergences of quantum general relativity worse, not 
better. Of course the explanation is that superstring theory is not general 
relativity; it doesn't even have the metric as one of its fundamental degrees 
of freedom. Although superstrings do incorporate gravity, stress-energy is 
not the source of gravity at high wave number, so superstrings violate the 
3rd of the four propositions listed at the beginning of section 3. 

All of this raises the question of what superstring theory is on the fun- 
damental level. The unsatisfactory answer is that no one knows! Based on 
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the way we have of expressing the perturbative S-matrix one might think the 
fundamental variable should be a string field, that is, a field whose argument 
is a string's position in spacetime X^{a), just like a normal field could be 
regarded as depending upon a particle's position x^. String field theories 
can indeed be constructed which reproduce the perturbative string S-matrix 
|58j . The earliest ones were in a noncovariant formalism due to Kaku and 
Kikkawa [SH], but the 1980's witnessed the development of lovely invariant 
Lagrangians by Witten [60] and Horowitz, Lykken, Rohm and Strominger 
[61j . The trouble with these string field theories is that they must be nonlo- 
cal, that is, they cannot be expressed in terms of just the dynamical variable 
and a finite number of its derivatives [62] . This is not some defect of how a 
string field theory was constructed from the perturbative S-matrix, it must 
be true in order for superstrings to avoid divergences. Instead of the coupling 
to the energy of a mode, superstring field theories couple to the energy times 
exponentials of the wave number. 

General Relativity =^ E{k) = hc\\k\\ , (181) 
String Field Theory =^ hc\\k\\ x e""'"^"' , (182) 

where a' is the Regge slope parameter. Of course these exponentials make 
the integrals converge at high \\k\\, but one has to wonder if they engender 
new problems. 

Anyone who has studied quantum mechanics knows that exponentiating 
the derivative operator effects a spatial translation, 

'^^ikx ^ e'fcAx^^^^ ^ /(x + Ax) . (183) 

Exponentiating the square of a derivative, as in (11821) . involves a superposi- 
tion over translations of all distances out to infinity. And one must of course 
include temporal as well as spatial derivatives or else there would be a mas- 
sive violation of Lorentz invariance. Recall from section 2.5 that adding even 
a single higher time derivative results in new degrees of freedom which have 
the opposite kinetic energy to the lower derivative degrees of freedom. It 
turns out that the problem grows worse the more derivatives you add: there 
is an extra degree of freedom for each new time derivative and essentially 
half of these new degrees of freedom carry negative kinetic energy. That 
is why we cannot modify the gravitational equations of motion to include 
the counterterm, or the higher derivative counterterm whose necessity for 
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pure gravity proven by Goroff and Sagnotti [SI El' So is it alright to have 
exponentials of the derivative operator in string field theory? 

Not every nonlocal field theory succumbs to the Ostrogradskian insta- 
bility [63]. But the nonlocality of string field theory must be restricted to 
entire functions of the derivative operator or else the conservation of proba- 
bility would not work out correctly. And the definition of an entire function 
is that it converges to its Taylor series expansion. This means that string 
field theory can be viewed as the limit of a sequence of ever-higher derivative 
models, which grow more and more unstable. One sometimes hears the state- 
ment that the extra degrees of freedom might decouple in the limit because 
they get driven to infinite frequency, but recall the fallacy of this argument 
from section 2.5: it plays on one's intuition, from lower derivative theories, 
that high frequency modes cannot be excited because there is only a limited 
amount of free energy. That is only true when all degrees of freedom have 
positive energy; when there are negative energy degrees of freedom, the high 
frequency modes can be excited by also exciting modes with the opposite 
energy. Far from dropping out, high frequency modes of negative energy 
dominate because there are so many more of themjf] 

All of this led David Eliezer and me to conclude in 1989 that string field 
theory must suffer from the Ostrogradskian instability [65]. It doesn't show 
up in perturbation theory simply because the nonlocality is restricted to 
interactions, but it is present in the full theory. Of course the only reason for 
studying string field theory is to get a nonperturbative definition of string 
theory, so our result means either that string theory is wrong or else that 
some other formalism defines string theory on the nonperturbative level. 
Most string theorists take the latter view, but there has so far been no other 
nonperturbative way of defining the formalism in generally 

It has been notoriously difficult to derive testable predictions from su- 
perstring theory. A notable exception to this is the 1988 observation by 
Antoniadis, Bachas, Lewellen and Tomaras that, if string supersymmetry, 
which must be broken at the low energy scales we can access in experiments, 

""In view of the recurring efforts to legitimize entire functions of the derivative operator I 
should observe that, were they allowed, we could solve the divergence problems of quantum 
gravity without superstrings [64j . 

^Note again that Maldacena's AdS/CFT correspondence [39] is now widely accepted as 
providing a nonperturbative formulation of string theory (in terms of an ordinary quantum 
field theory) for the boundary conditions associated with the most symmetric solution of 
general relativity with a negative cosmological constant. 
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should happen to be broken perturbatively, then there has to be at least one 
"large" extra dimension within experimental reach [OSI- This was elaborated 
further |67j and has led to a huge amount of work which I won't attempt to 
review, but I do wish to relate a funny story about it. The story concerns 
a "phenomenologist" — the kind of particle theorist who works closely with 
experimentalists to explain data — who was frustrated by the sometimes in- 
comprehensible mathematics of string theory. After a seminar by Antoniadis 
she exclaimed: 

/ can't usually understand much of what the string theorists tell 
me. But this, this looks like a prediction! 

That concludes the portion of string history which has the greatest rele- 
vance to this article. Of course there have been many more developments in 
superstring theory. Some of the principal ones are: 

• In 1995 Witten suggested that relations between the various 10-dimen- 
sional superstring theories point to an 11-dimensional parent that was 
dubbed M-Theory [68]. 

• Also in 1995, Polchinski showed that extended objects called D-branes, 
on which open strings can end, are necessary to realize one of the 
relations between superstrings [69] . 

• The first microphysical derivation, in 1996, of the Bekenstein-Hawking 
entropy of a black hole by Strominger and Vafa [70]. This is the most 
successful application of superstring theory, although one should note 
that subsequent derivations were made (for example, from loop quan- 
tum gravity [71]) and Carlip has suggested that the result follows from 
conformal symmetry near the horizon rather than from the details of 
superstring theory [72] . 

• Maldacena's 1997 conjecture that string theory on one spacetime mani- 
fold is equivalent to a certain quantum field theory without gravity on a 
lower-dimensional manifold [39] . This is a terrifically important insight 
because it might serve as the long-sought, nonperturbative definition 
of superstring theory. It has also been applied to nuclear physics and 
even condensed matter physics with interesting results. 
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• The 2000 discovery by Busso and Polchinski that vast numbers of dis- 
crete choices, called flux vacua, can be made in compactifying super- 
string theory f73]. The number of these choices is estimated to run 
from between 10^°° and 10^°° [71], and no principle is yet known to fix 
which one is taken. 

• The 2003 suggestion by Susskind that the Anthropic Principle might 
usefully constrain where we are in the vast string landscape of flux vacua 
|75j . The Anthropic Principle states that physical laws must be such 
that humans of some other form of intelligent life can exist to observe 
them. 

The last two developments on my list, and the fact that string theory 
predicts the wrong sign for the cosmological constant, have led to a slackening 
of interest in superstrings as a fundamental theory of everything, although 
there is still much work on applications. (I will discuss one in the next 
subsection.) In the face of numbers like 10^°°, some physicists now speak of 
a "theory of anything" rather than a theory of everything. And even veterans 
of decades of string research find it difficult to accept the Anthropic Principle. 
A personal anecdote might best convey the current state of affairs. Early in 
the spring 2007 semester my University of Florida colleague, Charles Thorn, 
began a seminar by announcing his belief that: 

String theory is just a technique for summing the leading terms 
in the 1/N expansion of QCD. 

After years of hearing more ambitious assessments this was so shocking that 
I checked to be sure I had understood correctly. Charles confirmed that I 
had; in his current view, the effort to regard superstrings as a fundamental 
theory of everything was a blind alley. Later that year I related Charles' 
pronouncement to string theory colleagues on three continents and solicited 
their own opinions. About half of them agreed with him, more often the 
younger people. 

String theory has occupied some of the best minds of particle theory for 
many decades, and it would require more than the space allotted for this 
article to do justice to their efforts. Nor do I possess the expertise for it. Let 
me instead refer the interested reader to some standard books by the leaders 
of this field. I recommend the texts by Green, Schwarz and Witten [76], by 
Polchinski [77], by Zwiebach ^78j, by Kiritsis [79] and by Becker, Becker and 
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Schwarz [HO]. In fairness I should also mention popular science books which 
are critical of string theory by Smolin ^T] and by Woit [52] • 

4.2 On-Shell Finiteness 

Although superstring theory may be short on successful predictions, it does 
provide a very efficient way of organizing perturbative calculations which in- 
volve corrections from many components of tensor fields such as the metric. 
In fact the simplest method for computing the lowest order scattering am- 
phtude for 4-gravitons in general relativity is to use superstring theory and 
then take a certain limit [S3] • A few more tricks from superstring theory, and 
a lot of hard work by many physicists over the course of two decades, has led 
Bern, Dixon and Roiban to conjecture that a version of supergravity might 
actually be On-Shell Finite to all orders in perturbation theory [83]. I shall 
first explain what this means, what the theory is which may have this prop- 
erty, and recent results concerning it. Then I will review the developments 
that led to this possibility. 

Recall that renormalizable quantum field theories have divergences which 
can be absorbed into redefinitions of free parameters. Once this is done one 
can compute not only scattering amplitudes but also the expectation val- 
ues of products of the field operators at different points, which are known 
as correlation functions. In contrast, an on-shell finite theory has scattering 
amplitudes which are finite, without the need for renormalization, but its cor- 
relation functions are not necessarily finite or even renormalizable|l| On-shell 
finiteness has particular importance for theories of gravity and supergravity 
because they cannot be perturbatively renormalizable, but divergences tend 
to cancel out of their scattering amplitudes. For example, pure gravity, with- 
out any matter, is on-shell finite at first order in perturbation theory [2] . One 
has to go to the second order before divergences occur [6l [7] . 

Recall that a supersymmetry relates fermions to bosons. This is a very 
good thing for divergences because bosons contribute a positive 0-point en- 
ergy whereas fermions contribute a negative 0-point energy, so if one contrives 
a tight relation between the two kinds of particles then it is conceivable that 
quantum corrections from fermions will cancel the divergences in quantum 
corrections from bosons, order-by-order in perturbation theory. Just that 

^However, one can construct redefinitions of the original fields which give finite corre- 
lation functions. 
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seems to be happening in a model called, = 4 Super-Yang-Mills theory. 
I should explain that "Yang-Mills" theories are nonlinear generalizations of 
electrodynamics which can describe forces such as the weak and strong in- 
teractions. They do not have to be supersymmetric, but the term "iV = 4" 
means this particular model has the maximum amount of supersymmetry 
possible for a Yang-Mills theory. Although this supersymmetry almost cer- 
tainly makes the model finite, it comes at a terrible price: = 4 Super- 
Yang-Mills theory cannot incorporate the known particles of the Standard 
Model. Further, the exact cancellation of divergences is only valid as long as 
the particles are all massless, which is very far from the observed universe in 
which only the photon, the "gluon" which carries the strong force, and the 
graviton appear to be massless. 

The gravity model which might be finite is called, 'W = 8 Super gravity." 
Just like A^ = 4 Super- Yang-Mills theory, its cannot be a realistic model of the 
universe because its particles are all massless, and they lack other essential 
properties of the known particles in the Standard Model. The spectrum 
of A^ = 8 Supergravity consists of: a graviton, 8 spin | particles called 
gravitinos, 28 vectors bosons, 56 spin ^ fermions and 70 scalar bosons. The 
term N = 8 means this model has the maximum number of independent 
supersymmetries for a theory of gravity. 

It might seem strange that people are so fascinated with a theory which 
they know cannot describe the universe. The reason is that supergravity 
models with less supersymmetry do have a chance to describe physics. Al- 
though these models are finite at first order in perturbation theory, and even 
at second order, it had been believed that they must suffer uncontrollable 
divergences at third order [85]. Those expectations are based on demonstrat- 
ing that counterterms exist at higher orders which are not forced to vanish 
by any known symmetry or dynamical relation. There is a widespread be- 
lief in particle physics that any divergence which can happen, must happen. 
Weinberg refers to such beliefs as "folk-theorems", and this one has never 
been checked because it was just too difficult to make the required compu- 
tations. The interest in A^ = 8 Supergravity derives from the fact that it 
has recently become possible to check for divergences in this model and they 
aren't present at third order |86j . 

Of course the third order result prompted supergravity experts to re- 
examine the old arguments [85] and they soon explained why the divergence 
fails to occur [87] . They also predicted a new divergence at higher order [87] . 
There was even a famous bet about this between Zvi Bern and Kelly Stelle. 
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The funny thing is, when the prediction was checked, the theory was again 
seen to be finite [SB]. As a good scientist, Stelle has admitted he lost the bet 
and is hard at work trying to understand what happened. 

The computational technology which made this possible began to be de- 
veloped in the late 1980 's by Zvi Bern and David Kosower. They noticed 
how much more efficient string theory is than the existing applications of 
Yang-Mills theory at computing scattering amplitudes. Recall that Yang- 
Mills theories can describe the strong interactions, which are an important 
background for very high energy experiments. People needed an efficient way 
of computing Yang-Mills amplitudes with lots of external particles and Bern 
and Kosower developed one by taking suitable limits in string theory By 
1990-91 they had extracted the essential simplifications of string theory and 
worked out how to do efficient computations directly in Yang-Mills theory 
|90j . In 1992 they were joined by Lance Dixon and the trio derived a number 
of useful results about the strong interactions during the mid 1990's [9T| . 

By the turn of the century Bern, Kosower and Dixon had developed a 
technique they called the "unitarity method" in which perturbative correc- 
tions could be generated from the lowest order results [52]. The idea is a 
more sophisticated version of the old bootstrap program from the 1960 's. If 
we express the S-matrix as S* = / -|- iT, then unitarity implies, 

^"^ X ^ =^ -i{T-T^) =T^ xT . (184) 

So if the T matrix consists of a perturbative series expansion, one can deter- 
mine the A^th order contribution to (T — T^) from sums of products of lower 
order terms on the right hand side of fll84p . Then the A^th order contribu- 
tion to (T + T'^) can be inferred by a a procedure known as "bootstrapping", 
and the stage is set to push one order higher |93l [HI [95]. This procedure 
works best for supersymmetric Yang-Mills theory because then the bootstrap 
is simplest, and the bootstrap works best of all for = 4 Super- Yang-Mills 

m- 

The reader might wonder why an efficient technique for computing Super- 
Yang-Mills amplitudes has any relevance for supergravity. The connection 
is provided by the Kawai-Lewellen-Tye (KLT) relations between open and 
closed string amplitudes [97]. The KLT relations say that a lowest order 
gravitational scattering amplitude can be factorized into sums of products 
of Yang-Mills amplitudes. These relations are relatively simple to see from 
string theory but so difficult to recognize for ordinary field theory that they 
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were only inferred using string theory. Of course superstring theory requires 
ten dimensions, which we don't want, and it also contains infinite towers of 
super- massive particles that are present in neither N = 4 Super- Yang-Mills 
nor = 8 Supergravity. Both problems are avoided by taking the low energy 
limit [98]. So the key to computing high order perturbative corrections to the 
sattering amplitudes of = 8 Supergravity is first to decompose these am- 
plitudes into products of zeroth order amplitudes using the unitarity method, 
then evaluate the later in terms of = 4 Super- Yang- Mills amplitudes [99] . 

The most recent results about N = 8 Supergravity [88] mean something 
is wrong with our thinking about what sorts of divergences can happen. No 
one knows how high the cancellations extend in = 8 Supergravity, or if 
they might apply to more realistic models. We could be witnessing the start 
of a revolution. The latest progress on this subject is so new that there are 
no books or long review articles. I highly recommend the short article by 
Bern, Carrasco and Johansson [100] . 

4.3 Asymptotic Safety 

Recall from section 2.6 that, if one doesn't consider the and countert- 
erms to be part of the gravitational field equations then the divergences of 
quantum gravity get worse at each order in perturbation theory. The same 
thing happens if you include the 4th derivative counterterms but regard them 
as perturbations, rather than as part of the 0th order field equations. In that 
case each new order in perturbation theory requires the introduction of coun- 
terterms which contain two more derivatives of the metric. Accepting this 
escalating series of counterterms might seem crazy, in view of the fact it could 
be avoided by simply considering the and counterterms to be part of 
the 0th order equations. However, the procedure has a saving virtue: if we 
regard the higher derivative counterterms as perturbations then they do not 
add new degrees of freedom to the theory, and the Ostrogradskian instability 
of section 2.5 is avoided. 

The program of Asymptotic Safety is to accept the escalating series of 
perturbative counterterms and attempt to show that interesting predictions 
can be derived from the resulting formalism [101] . To make this approach 
intelligible I will first explain why regarding the counterterms as perturbative 
avoids new degrees of freedom. Then I discuss how the counterterms affect 
the theory's ability to make predictions. Finally, I will review progress on 
the subject. 



62 



The discussion of degrees of freedom is simplest in the context of a one di- 
mensional, point particle whose position as a function of time is q(t). Suppose 
that the Lagrangian is that of a harmonic oscillator with a higher derivative, 
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Here m is the particle's mass, u is a frequency and g is a small, positive num- 
ber I wish to think of as a coupling constant. The Euler-Lagrange equation, 

9 A +q + ^2^ = o, (186) 



has the general initial value solution, 

q{t) = cos(A;+t) + 5+ sin(A;+t) + A_ cos{k_t) + B_ sin{k_t) , (187) 
where the two frequencies and the four combination coefficients are. 



k. 



' kiqo + qo „ klqo + d^qo/dt^ 



2^L^^V- ^^J ' kl-kl ' k±{kl-kl) ■ 

(188) 

The + mode carries positive energy; it is the mode that would appear even for 
(yf = 0. The — mode carries negative energy; it is the new, higher derivative 
degree of freedom. 

Suppose we regard the higher derivative term as a perturbation. That 
amounts to making the substitution, 

oo 
n=0 

in the Euler-Lagrange equation 01861) and then segregating terms with the 
same powers of g. The resulting series of equations is, 

Xo + u'^Xq = 0, (190) 

*^+"'^^ = '182' 

and so on. Note that the higher derivative terms always enter as sources, 
evaluated at the lower order solution. They don't appear at 0th order, so one 
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only needs go and go to get a unique solution Xo(t). The higher derivative term 
does appear in the 1st order equation, but only in the form of d'^Xo(t)/dt'^, 
which was already fixed at 0th order, so one again needs just go and go to 
determine xi{t). Because the equation is linear the resulting series can be 
summed up and gives, 

gpert = goCos(A;+t) + -p- sin(/c+t) . (193) 

Note that the higher derivative term did have an effect, it shifted the fre- 
quency of oscillation from the 0th order result of u to kj^. However, we 
have avoided the higher derivative degrees of freedom which give rise to the 
Ostrogradskian instability. 

Although explicit results are difficult to obtain when the equations be- 
come nonlinear, one can prove that this procedure works generally [651 1102] , 
so it should apply to the higher derivative counterterms of quantum gravity. 
Of course it cannot serve as a nonperturbative definition of quantum grav- 
ity (in this I dispute the official position of Asymptotic Safety) but I have 
already explained in section 2.3 that even the asymptotic series solutions 
one derives from perturbation theory should be wonderfully accurate at low 
energies. The problem is the escalating series of counterterms, each with its 
own completely arbitrary, finite part. Those finite parts can only be fixed 
by requiring some prediction of the theory to agree with measurement. But 
each time this is done one loses a potential prediction, and it must be done 
an infinite number of times! In such a case one wonders what the resulting 
formalism can be used to predict? 

One answer to this question was provided by John F. Donoghue [103] . 
He pointed out that all counterterms must take the form of polynomials 
of the Fourier wave vector p such as we found in expression (11151) . The 
finite contributions (I116P contain some terms of this form — and they will 
be rendered ambiguous by the arbitrary finite parts of the counterterms. 
However, there are also terms in (11161) of the form p^ln(L^p^) which could 
never have been part of a counterterm and are logarithmically enhanced with 
respect to the p'^ counterterms in the small p regime which is most accessible 
to experiment. That is how a unique result was obtained for the first quantum 
gravitational correction (I162p to the Earth's potential [25]. This way of using 
a nonrenormalizable quantum field theory to make predictions is known as 
low energy effective field theory. 
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The program of Asymptotic Safety aims at the even better possibihty that 
we might be able to predict the finite parts of all the counterterms in such a 
way that quantum general relativity could be used at all energies. Explaining 
how this works in any detail would go far beyond the scope of this article but 
I will try to put the idea across in simple terms. Recall from our discussion 
of renormalization in section 2.4 that the strengths of interactions change as 
one varies the energy scale. This running of coupling constants has many 
important consequences for particle physics. For example, forcing the strong 
interactions into a regime for which perturbation theory is reliable provides 
our principal check on the theory, and the fact that the various interactions 
become comparable at a very high energy is the strongest evidence for Grand 
Unification. The new counterterms one finds in quantum general relativity 
vary with the energy scale as well. The hope of Asymptotic Safety is that 
this variation might carry each of them to a unique value called a nontrivial 
ultraviolet fixed point. If this proves to be the case then we would not have 
to use up predictions in order to determine the finite parts of counterterms. 
Instead we would just set them to their values at the ultraviolet fixed point 
and this should be very near to the correct result at sufficiently high energies. 

The possibility of Asymptotic Safety was recognized by Steven Weinberg 
in 1979 jlOl ]. The equation which governs how coupling constants change 
is known as the Exact Renormalization Group Equation and was derived by 
Christof Wetterich in 1992 |1U4] . One can never solve this for the infinity of 
necessary counterterms so what is done instead is to study the way selected 
combinations of counterterms fiow. Exploring these truncations cannot prove 
Asymptotic Safety, but it could disprove the conjecture, and the results so 
far are consistent with the existence of an ultraviolet fixed point. However, it 
has been computationally impossible to include the first really problematic 
counterterm — the one whose divergent coefficient was computed by Goroff 
and Sagnotti [6l [7] . Much recent work on Asymptotic Safety has been done 
by Oliver Lauscher and Martin Renter [105j . Roberto Percacci |106j and 
by Percacci and Daniele Perini |107j . Interested readers should consult the 
review by Lauscher and Renter |1U8] . 

4.4 Loop Quantum Gravity 

Recall that relativists suspect the problems of quantum general relativity 
derive from using perturbation theory. They therefore insist on a very care- 
ful formulation which is not based upon perturbing around any background 
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geometry. That has far-reaching consequences. It means that relativists can 
not accept, as a complete description of quantum gravity, the asymptotic 
scattering states employed by particle theorists, nor can they accept the de- 
cay rates and cross sections we use as observables, nor even our inner product! 
So the full apparatus of quantum mechanics (states, inner product, observ- 
ables) must be constructed from the beginning and with an unprecedented 
level of rigor and generality. It should be noted that we do not possess such 
a general formulation even for quantum electrodynamics, which is the best 
understood and most thoroughly tested quantum field theory. 

Loop quantum gravity is based upon a Hamiltonian for general relativ- 
ity which was developed by Abhay Ashtekar in 1986 |1U9] . Recall that any 
Hamiltonian formalism has "coordinates" (the Q's) and conjugate momenta 
(the P's), and it is usual to consider quantum mechanical states to be func- 
tions of the coordinates. The term "loop" in the name comes from the way 
Ashtekar's coordinate variables are organized into "Wilson loops", a sort 
of line integral (which also involves continuous multiplication of matrices) 
around closed loops. Under certain assumptions one can show that this is 
the unique way of organizing the quantum theory |11U] . 

Any Hamiltonian formalism which involves one or more of the fundamen- 
tal forces will possess constraint equations. Solving these at the required level 
of rigor and generality is a major problem for loop quantum gravity [111] . 
To understand what a constraint equation is, recall the discussion in section 
2.1 concerning the triune nature of fundamental force fields: 

• One part of the field can be changed arbitrarily by a symmetry trans- 
formation which is associated with the conservation law; 

• Another part of the field is completely fixed by its sources; and 

• The final part consists of independent degrees of freedom. 

Solving the constraint equations is what determines the part of the force 
field which is completely fixed by its sources. The constraint equation of 
electrodynamics is Gauss's law (eoV ■ E = p), another equation is fixed 
by current conservation and the remaining two are dynamical. In general 
relativity four of the ten Einstein equations are constraints, four are fixed 
by stress-energy conservation and the remaining two are dynamical. 

In the Hamiltonian formalism of loop quantum gravity one doesn't think 
of the constraint equations as determining field operators, the way I did in 
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section 2, but rather as restricting how state wave functions can depend upon 
the coordinates and also what operators can be observed. It is simple enough 
to work this out perturbatively to a given order, but that sort of perturbative 
solution is exactly what relativists wish to avoid! This makes things tougher 
but perhaps not impossible. The constraint equations of electrodynamics and 
the other fundamental forces have been solved nonperturbatively, as have all 
but one of the quantum gravitational constraints. Unfortunately, there is not 
yet a general solution to the final constraint. 

The absence of explicit states or observables makes it difficult to compute 
much in full loop quantum gravity. In a number of symmetry reduced models 
(see, e.g. |112j ) the program has been completed and the resulting quantum 
theory sheds light on important conceptual issues such as the meaning of 
time and dynamics in background independent physics and the likely fate 
of the most interesting classical singularities in quantum gravity. Work has 
also been done on developing approximation procedures to construct explicit 
observables [113] . 

A measure of the difficulty involved is the effort that went into deriv- 
ing the free graviton propagator [114] . This is a completely trivial exercise 
for particle theorists but it required several years of hard work using loop 
quantum gravity precisely because the concepts involved in its construction 
are highly nontrivial in a background independent approach. At the current 
stage, it also seems to involve a number of choices which indicates that the 
formalism is not yet complete. 

The field of loop quantum gravity has grown to include work on quantum 
cosmology, black hole entropy, the issue of information loss, coupling to mat- 
ter, path integral quantization and the breaking or deformation of Poincare 
invariance. Because this approach is far removed from my own area of ex- 
pertise it is best that interested readers consult recent reviews by the leaders 
of the field. I recommend the articles by Ashtekar and Lewandowski [115| . 
by Smolin [116j . and the books by Rovelli |117] and Thiemann [118] . 

4.5 Causal Dynamical Triangulations 

I have admitted my bias as a particle theorist who loves being able to get 
approximate results using perturbation theory, and I have tried to be honest 
about the possibility that these results might be erroneous, as so many of 
my relativist friends believe. However, exact calculations are unlikely to be 
attainable for quantum gravity, so the most fruitful way of questioning per- 



67 



turbation theory is to develop better approximation techniques. An example 
of work along these lines is the program of Causal Dynamical Triangulations 
by Jan Ambjorn, Jerzy Jurkiewicz and Renate Loll. 

The idea behind Causal Dynamical Triangulations is to numerically sim- 
ulate a formal expression of quantum general relativity in terms of an infinite 
number of integrations, one for each of the ten components of the metric at 
each point in spacetime. Most everyone is familiar with the use of comput- 
ers to numerically evaluate a finite number of integrals. Of course there are 
an infinite number of points in continuum spacetime, and no computer can 
simulate more than a finite number of integrations. So one first discretizes 
the formal representation using a technique devised in 1961 by Tullio Regge 
for approximating classical general relativity without the use of coordinates 
|119j . Then the issue becomes how to take the continuum limit so that one 
plausibly recovers the correct theory. 

Recovering the correct continuum theory is highly nontrivial! In much 
simpler models for which the answer was known, physicists discovered that 
one typically has to make parameters change with the level of discretiza- 
tion in sensitive ways. And sometimes, one must introduce new parameters 
which don't seem to be present in the continuum theory. Of course we 
don't know exactly what the result should be for quantum gravity but we do 
know it should have four spacetime dimensions on macroscopic scales. With 
the most straightforward quantum and computer adaptations of Regge's ap- 
proach |12Ul I121j there does not seem to be any way of taking a continuum 
limit for pure gravity which will have this property |122l I123j , although the 
issue is still open when matter is added [124] . 

To surmount the problem Ambjorn and Loll developed a two dimensional 
variation of Regge's discretization in which time is given a preferred status 
|125] . Subsequent work with Jurkiewicz produced a four dimensional model 
|126j which seems to remain four dimensional as the number of elements 
in the triangulation increases |127j . On large scales the resulting spacetime 
looks like the most symmetric solution of the classical Einstein equations with 
a positive cosmological constant [127] . It is still not clear how to take the 
continuum limit, or if that limit even exists. And it is of course unknown if 
the continuum limit will reproduce Newtonian gravitation at large distances, 
or contain gravitational radiation. One should not expect progress too soon; 
it required decades of labor to attain better than 10% accuracy with nu- 
merical simulations of strong interactions in the low energy regime for which 
perturbation theory fails [128]. A very recent review of Causal Dynamical 
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Triangulations is [129] . 



5 Cosmology 

For years quantum gravity was a realm of speculation in which it was even re- 
spectable to deny the need for a quantum theory of gravity. That has changed 
recently with the advent of the first recognizable quantum gravitational data 
and it has revolutionized the field. These data come from cosmology, and 
many more are likely on the way, so any discussion of quantum gravity must 
include cosmology. I shall begin by introducing the classical metric which 
describes cosmology, and explaining how the Einstein equations constrain 
this metric. Then I discuss why primordial inflation is necessary and the 
simplest class of models which provide the stress-energy to support it. The 
key point of this section is why quantum gravitational effects are enhanced 
during inflation and why the results on cosmological perturbations [22l [23] 
represent the first ever recognized quantum gravitational data. 

5.1 FRW Geometry 

On the largest scales the universe seems to have no special origin or spe- 
cial directions [1301 1131] . These properties are known are homogeneity and 
isotropy, respectively. The universe also seems to be devoid of spatial cur- 
vature [22|. The spacetime geometry consistent with these three features is 
characterized by the following invariant element, 

ds'^ = -c^df + a^{t)dx ■ dx . (194) 

(The acronym "FRW" is formed from the names of three cosmologists: Alex- 
ander Friedman, Howard Percy Robertson and Arthur Geoffrey Walker.) The 
coordinate t represents physical time, the same as it does in fiat space. How- 
ever, the physical distance between x and y is not given by their Euclidean 
norm, \\x — y\\, but rather by a{t)\\x — y\\. Because it converts coordinate 
distance into physical distance a{t) is known as the scale factor. 

Although the scale factor is not directly measurable, three simple observ- 
able quantities can be constructed from it. 
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The redshift z gives the proportional increase in the wavelength of light emit- 
ted at time i and received at the current time, to- (I am ignoring the special 
relativistic Doppler shift.) Redshift is often used to measure cosmo logical 
time, even for epochs from which we detect no radiation. The Hubble pa- 
rameter H{t) gives the rate at which the universe is expanding. It's current 
value is. Ho = (70.5 ± 1.3) ~ 2.3 x lO-^^ Hz [22J. The deceleration 

parameter q{t) is less well measured. Observations of Type la supernovae are 
consistent with a current value of go — —-6 [SI [9]. 

Astronomers infer Hq and go by constructing Hubble Plots. Suppose the 
light from a distant star contains a distinctive absorption line measured at 
the wave length A. If the same line occurs at wave length A^; on Earth, we 
say the star's redshift is z = X/ Xe — 1. One can also measure the flux of 
energy from the star. If we understand the star well enough to know it 
should emit radiation at luminosity C (that is why Type la supernovae are 
important) we can infer its luminosity distance d^, which is the distance the 
star would be at if the scale factor was one, 



A Hubble Plot is a graph of z versus dj^ for many distant stars. 

Stars throughout the universe move with respect to their local environ- 
ments at typical velocities of about 10^'^ the speed of light c. This motion 
gives rise to a special relativistic Doppler shift of Az ~ ±10^'^. If spacetime 
was not expanding, this shift would be the only source of nonzero z, and 
averaging over many stars at the same luminosity distance would give zero 
redshift. That is just what happens for stars within our galaxy. However, 
the luminosity distances of stars in distant galaxies are observed to grow 
approximately linearly with their redshifts, 

c-'HodL = z + ^il~qo)z^ + 0{z'') . (197) 

One really does get Ho from the slope, although inferring qo requires extend- 
ing the plot to z ~ 1, at which point the expansion breaks down and one 
must use the Einstein equations. 

I cannot forbear to comment on the inconvenient minus sign in the defini- 
tion fll95p of q(t). It was placed there because almost all theoretical physicists 
were certain the current universe must be decelerating before the measure- 
ment was done in 1998. (Checking this belief was not a priority; the head of 
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one of the two teams who measured go told me he wasn't even funded at the 
time!) Physicists hke to define the parameters of equations to be positive so 
that one can infer general trends at a glance. Generations of physicists have 
cursed Benjamin Franklin for proposing the arbitrary sign convention that 
lead to electrons — which are the principal charge carrier of our electrical 
industry — having negative charge! But the universe played a trick on us 
and it is actually accelerating, rather than decelerating, so future generations 
will curse the minus sign we clever theorists inserted in the definition fll95p of 
q{t). Aside from general amusement, this tale should serve to caution readers 
about placing too much confidence (which means, any confidence at all) in 
the pronouncements of the scientific establishment on issues that have not 
yet been subjected to experimental and observational scrutiny. 



5.2 Einstein's Equations for FRW 

Homogeneity and isotropy restrict the stress-energy tensor to only an energy 
density p{t) and a pressure pit), 

Too = pit) , = , Tij = pit)gij, (198) 

where i and j are spatial indices. In this geometry Einstein's equations take 
the form. 



3H^-c^A = ^^p, (199) 

-2H-3H^ + c'A = ^p. (200) 

It is usual to redefine the energy density and pressure so as to absorb the 
cosmological constant, 

c^A c^A 
P^P+^ and P^P-^- (201) 

When this is done, the current energy density is, 

Po = ~ 8.5 X 10"^° J/in . (202) 

ottG 

This is the rest mass energy of about 5.7 Hydrogen atoms per cubic meter. 



71 



By differentiating fll99p and then adding 3H times f ll99p plus fl200p . we 
derive a relation between the energy density and pressure known as stress- 
energy conservation, 

p = -3H{p + p). (203) 

If we also assume a constant equation of state, w = p{t)/p{t), then relation 
(I203p can be used to express the energy density in terms of the scale factor, 

p{t)=pi{^) . (204) 

The substitution of flM) in ffT99D gives. 
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a{t) = ai [l + ^(1 + w)H,{t - t,)] . (205) 

The cases of w = +|, 0, — | and —1 correspond to radiation, non- 
relativistic matter, spatial curvature, and vacuum energy (which includes 
the cosmological constant), respectively. 

Radiation =^ p oc a""^ , a(t) oc (Hit) 2 , (206) 

Non-Relativistic Matter =^ p oc , a{t) oc {Hit)i , (207) 

Curvature =^ p oc , a{t) oc Hit , (208) 

Vacuum Energy =^ P oc 1 , a{t) oc e^^* . (209) 

The actual universe seems to be composed of at least three of the pure types, 
so the scale factor does not have a simple time dependence. However, as long 
as each type is separately conserved, we can use (12040 to conclude that, 

/,N Prad Pmat Pcur /oinN 

As the universe expands, the relative importance of the four types changes. 
Whenever a single type predominates, we can infer a{t) from fl205p . This 
different dependence is one reason it makes sense to think of an early universe 
dominated by radiation f l206p . evolving to a universe dominated by non- 
relativistic matter (12070 . It is also how one can understand that the current 
universe seems to be making the transition to domination by vacuum energy 
(1209D. 

Under certain conditions there can be significant energy flows between 
three of the pure types of stress-energy. For example, as the early universe 
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cooled, massive particles changed from behaving like radiation to behaving 
like non-relativistic matter. This change would increase pmat and decrease 
Prad in Eq. fl210p . The parameter that cannot change is that of the spatial 
curvature, pcur- I should not actually have regarded spatial curvature as a 
type of stress-energy, but rather as an additional parameter in the homo- 
geneous and isotropic metric (11941) . 1 avoided this complication because the 
extra terms it gives in the Einstein equations (119911200]) can be subsumed into 
the energy density and pressure, and because the measured value of Pcur/oo 
is consistent with zero \22\ . 



5.3 Primordial Inflation 

The cosmology in which a radiation dominated universe evolves to matter 
domination is a feature of what is known as the Big Bang scenario. Although 
strongly supported by observation [130[ llSlj . the composition of p at the 
start of radiation domination (at which the scale factor is ctrad) does not 
seem natural, 

Prad Prad Pcur /m T\ 

— > Pvac , — > — • (211) 

^rad ^rad '^rad 

It might be expected instead that each of the three terms was comparable, in 
which case the universe would quickly become dominated by vacuum energy. 
There is no accepted explanation for the first inequality of ( 12111) . or for the 
seeming coincidence that Pmat/flo ~ Pvac- However, the second inequality of 
( 12111) finds a natural explanation in the context of Primordial Inflation. 

Inflation is defined as a phase of accelerated expansion, that is, q{t) < 
with H{t) > 0. From the current values of the cosmological parameters one 
can see that the universe is in such a phase now. Recall from section 3.2 that 
explaining why this is happening is one thing a theory of quantum gravity 
might do. However, for now I wish to discuss primordial inflation, which 
is conjectured to have occurred at something like 10~^^ seconds after the 
beginning of the universe with a Hubble parameter 55 orders of magnitude 
larger than it is today. I will be more specific later about what might cause 
inflation but, for now, let us assume it is driven by a vacuum energy p'^^^ 3> 
Pvac (remember it is only pcur which cannot change) and that it begins at 
scale factor Oinf. If the energy densities of curvature and vacuum energy 
are comparable at the beginning of inflation then their ratio by the onset of 
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radiation domination is, 



^ n' 

inf r vac 



Pcur 



~ 1 



Pcur 



( 



Q-rad 



(212) 



In the next subsection I will explain why this number is smaller than about 
10~^^. Inflation makes the other types of stress-energy even smaller, but 
there are mechanisms through which the primordial vacuum energy fl^^ can 
be converted into matter and radiation. This process, which I will not discuss, 
is known as reheating. 

Inflation also explains how the large scale universe became so nearly ho- 
mogeneous and isotropic. This explanation is crucial because gravity makes 
even tiny inhomogeneities grow, and the process has had 13.7 billion years to 
operate. It is believed that the galaxies of today's universe had their origins 
in quantum fluctuations of magnitude Ap/p ^ 10~^, which occurred during 
the last 60 e-foldings of inflation. The imprint of these fluctuations in the 
cosmic microwave background has recently been imaged with unprecedented 
accuracy by the WMAP satellite [22] . They also show up in large scale struc- 
ture surveys [23]. If this view is correct, these observations represent the first 
quantum gravitational data. 

5.4 The Smoothness Problem 

There are many reasons for believing that the very early universe underwent 
a phase of primordial inflation [132]. I will confine myself to reviewing how 
inflation resolves the smoothness problem. This can be summed up in the 
question, why does the large scale universe possess such a simple geometry 



To understand the problem we need to compare the distance light can 
travel from the beginning of the universe to the time of some observable 
event, with the distance it can travel from then to the present. Recall from 
special relativity that light rays travel along paths of zero invariant length, 
ds"^ = 0. From the invariant element (11941) we see that the radial position of 
a light ray obeys, dr = ±cdt/a{t). The minus sign gives the past light-cone 
of the point = (cto, 0), whereas the plus sign gives the future light-cone of 
a point = (ctbeg, 0) at the beginning of the universe. 



HMD? 





(213) 
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We can observe thermal radiation from the time of decouphng {z^ec — 1089) 
whose temperature is isotropic to one part in 10^. This is a much higher 
degree of thermal equilibrium than exists in the air in any office! Unless the 
universe simply began this way — which seems unlikely — this equilibrium 
must have been established by processes acting at or below the speed of light. 
In other words, we must have -Rfuture > -Rpast- 

Suppose that, during the period ti < t < t2, the deceleration parameter 
is constant q{t) = qi- In that case we can obtain explicit expressions for the 
Hubble parameter and the scale factor in terms of their values at t = ti. 



H{t) = . ^ ^ . , , . and a{t) = a^[l + il + q^)H,{t-t^ 
i + [i + qi)Ui[t — ti) L 

(214) 

These expressions permit us to evaluate the fundamental integral involved in 
the past and future light-cones fl213p . 

fdt^ = ^—[l+{l+q,)H,{t-U)]^l'= ^ - ^1 . (215) 

Jti a[t) aiiiigi L J ti q^ La2-H2 cti-Hi J 

Although go is negative, this is a recent event (2; ~ 1) which followed a 
long period of nearly perfect matter domination with q = Much before 
the time of matter-radiation equality {zeq — 3300) the universe was almost 
perfectly radiation-dominated, which corresponds to g = +1. To simplify 
the computation we will ignore the recent phase of acceleration and also the 
transition periods. 



+ Z Z < Zen 

1+^ yz>z ■ (216) 

The cosmic microwave radiation was emitted within about a hundred red- 
shifts of Zdec < Zeq, SO the past light-cone is, 

aoHo [Vl + Vl+Zdec) clqHq 

The future light-cone derives from both epochs and it depends slightly upon 
the beginning redshift, Zheg, 



eg 



aoHo \ Vl+^^dec y/T+zTq] aoi/ol ^ + Zeq l + Zbeg 



(218) 
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One maximizes i^future by taking z^eg — > oo, but it isn't enough, 

lim i?future ^ - -L \ . (219) 

Under the assumption of g = +1 before Zf>,q we are forced to conclude that 
the 2-dimensional surface we can see from the time of decouphng consists of 
about, 

/ ^Rpa^N 2 ^ z^, ^ 2200 , (220) 

^Ri..J [l-i(^)^]^ 

regions which cannot have exchanged even a photon since the beginning of 
time! So how did these 2200 different regions reach equihbrium to one part 
in 10^? 

The problem grows worse the earlier one believes the universe was ho- 
mogeneous. For example, the seven lightest nuclear species were almost all 
produced during Nucleosynthesis at about Zn^^ — 10^ and their isotopic abun- 
dances seem to be uniform over the observed universe. For Nucleosynthesis 
the radii of the past and future light-cones are. 



E^^^^^^ = ^_ <j V - -eg _ V - ' ^eg I ^ ^_ ^222) 

aoHo I 1 + 

Znuc 

So the assumption of g = -|-1 for all time before tgq implies that the number of 
causally disconnected regions at the time of Nucleosynthesis which comprise 
the current universe is about, 

/ -Rpast \2 4=z'^uc inl5 



^ j ~ — ^ ~ 10^' , (223) 

-^future Zgq 

The corresponding numbers for the phase of quark-gluon plasma {z ~ 10^^) 
and the electroweak phase transition (z ~ 10^^) are 10^^ and 10^^, respec- 
tively. One could quibble about the extent to which we know the universe was 
homogeneous at these times but it seems obvious something is very wrong 
with the assumption of positive deceleration throughout cosmic history. 
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This embarrassment resulted from the fact that the upper hmit of integra- 
tion dominates -Rfuture for positive deceleration. Inflation solves the problem 
by positing a very early epoch of negative deceleration. Let us suppose the 
deceleration is g = — 1 for z > Zrad- In that case our simplified model of 
cosmic history (I216p generalizes to, 



\/l + Z < Z < Zeg 

a{t)H{t) ~ aoH^ z^g < z < Zrad . (224) 

Zrad ^ Z <i Zi^f 



rad 



Now let us compute the past and future light-cones of some event at redshift 
z in the radiation dominated period. Of course our previous result of -Rpast — 
2c/aoHo is still valid. However, the initial phase of acceleration makes a 
profound change in the future light-cone. During acceleration it is the lower 
limit of integration which dominates the future light-cone, and the result 
can be made as large as desired simply by increasing the redshift z^f at the 
beginning of inflation. Under the assumption that Zinf ^ Zrad z we get. 



l-future 




Zeg V ^eg Zinf 



Zrad ^rad 



(225) 
(226) 



Inflation explains the smoothness of the large scale universe by supposing 
that everything we now see derived from a region which was small enough 
for causal processes to make it homogeneous and isotropic. Then inflation 
stretched it out and the various pieces slowly came back into contact, after 
inflation, still looking very much alike. In fact the inhomogeneities we now see 
on less than cosmic scales derived from almost 14 billion years of gravitational 
collapse operating on a universe that was smooth to one part in 10^ just after 
inflation. 

The usual assumption that z^ad — 10^^ derives from supposing that ra- 
diation domination commences at a scale of about 10^^ GeV. If we require 
-Rfuture ^ -Rpast then Zinf ^ 10^^. (Many models of inflation vastly exceed 
this minimum.) Note that primordial inflation not only solves the smooth- 
ness problem, it also explains why the spatial curvature (I212p is small, which 
is an observed fact 1221. 
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5.5 Slow Roll Scalar-Driven Inflation 

The case for an early period of accelerated expansion is very strong, and the 
idea was suggested even before the advent of inflation |133j . However, a com- 
pletely satisfactory mechanism for causing accelerated expansion has yet to 
be identified, either for primordial inflation or for the current phase. Guth's 
proposal |2Tj failed to have a satisfactory ending but this was quickly cor- 
rected by the slow roll sca/ar models proposed by Linde |134] and by Albrecht 
and Steinhardt [135j . Although many other classes of models have since then 
been devised, and none are without problems, these are the simplest and I 
will describe them. 

Slow roll scalar models are based upon a hypothetical spin zero (scalar) 
field called the inflaton (p{t,x). (Not having a good candidate for this field 
from fundamental theory is one problem with these models.) The Lagrangian 
for (p(t,x) also involves the metric g^i,(t,x), 



where (lei{gap) is the determinant of the metric and is its matrix inverse. 
Note how the metric, which is the true measure of lengths and times, modifies 
the infinitesimal coordinate volume d^x and the derivatives. This is typical 
of the way it couples to matter in general relativity. 
The stress-energy tensor of the inflaton is, 



One can see that the term involving the scalar potential has the same form 
as the cosmological constant A in the Einstein equations ([9]). By itself, this 
contribution would tend to make the universe accelerate, but one must also 
reckon with the kinetic terms which involve derivatives. In order for inflation 
to start, the scalar potential energy must dominate over its kinetic energy 
throughout a region somewhat larger than light can cross from the beginning 
of the universe. This is not anywhere near as bad as the terrific mismatches 
we found in the previous subsection but it does concern inflationary cos- 
mologists because such an initial condition can only have been an accident. 
Estimating how unlikely this accident is depends upon what one believes 
about how the universe began, and also involves tricky questions about how 




(228) 
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to compute probabilities. However, all estimates give very small numbers (I 
have heard 10~^^°) for the chances of it happening. 

Rather than start from an inhomogeneous configuration I will simply as- 
sume the initial condition was homogeneous and isotropic. I will also assume 
spatial flatness, which would in any case result, approximately, from a long 
phase of inflation. This means the metric takes the FRW form fll94p and that 
the scalar depends only upon time, ip{t,x) — > v'o(^)- The nontrivial Einstein 
equations are. 



3H^ - c^A 



-2H - 3H^ + c^A 



-2^ 



(229) 
(230) 



The deceleration parameter (times H^) can be computed from a linear com- 
bination of these equations. 



3c2 



00 
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So the condition for accelerated expansion is, 

+ ^ > ^ • 
The equation for the homogeneous scalar ^oif) is, 
(po + 3H^o + cV'(<^o) = , 



(231) 



(232) 



(233) 



where V'{(p) = dV/dip. The simplest model of scalar-driven inflation, and 
one which is still consistent with all data [22], consists of a constant plus a 
quadratic term. 



l/mc\2 



mc\2 



in. 



(234) 



Substituting ^ 
dilator, 



into 



gives the equation for a damped harmonic os- 



(fo + 3Hlpo + (^-^ j ipQ = 



(235) 



The term 3HipQ in equations fl233p and fl235p is known as Hubble friction. For 
sufficiently large H{t) one can see that it makes the scalar over-damped so 
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that it slowly rolls down the quadratic potential fl234p . That makes the scalar 
kinetic energy small, which will enforce the condition fl232p for accelerated 
expansion provided two more conditions hold. 

The first of these extra conditions is that the constant Vq must be chosen 
to almost cancel the cosmological constant, 

Vo = -|^+Pvac, (236) 

where pvac ~ 6 x 10~^° J/m^ is the currently observed value of the vacuum 
energy [HI [9] . Of course the absence of any explanation for this choice is the 
problem of the cosmological constant that I discussed in section 3.2. The 
second extra condition is that the scalar must start with a large enough 
initial value, and a small enough initial time derivative, so that (12321) holds 
initially. The absence of any explanation for this is part of what is known as 
the initial condition problem. 

If one assumes these two conditions then Hubble friction causes the scalar 
to slowly roll down its potential. As it rolls, the Hubble parameter grows 
smaller, which makes Hubble friction less effective. Eventually the system 
becomes under-damped and begins oscillating. If the scalar is coupled to 
other fields (the quantum corrections from which must be prevented from 
distorting its potential too much — another problem with this class of mod- 
els!) then this phase of oscillations can result in hot, radiation-dominated 
universe |136j . By choosing the initial value of the scalar sufficiently large 
one can make the phase of inflation last arbitrarily long, although making 
it too large can force the system to a regime known as eternal inflation in 
which quantum fluctuations actually push the scalar up its potential. 

I have described scalar slow-roll models to show that primordial inflation 
can be supported in a relatively simple way, even if this requires a number 
of arbitrary assumptions. There are other models, none of which is without 
problems. Although I very much doubt any of these models is correct, I 
consider the case for an early phase of accelerated expansion to be to be 
overwhelming. Finding a realistic model which causes this phase is one of 
the things I hope quantum gravity can do. 

5.6 The Strength of Quantum Effects during Inflation 

Leonard Parker was the first to give a quantitative assessment of how space- 
time expansion affects quantum processes [137] but one can understand some 
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of the things he found in a quahtative way. In particular, I will try to explain 
three crucial facts: 



• The expansion of spacetime strengthens quantum effects; 

• This strengthening is greatest during accelerated expansion; 

• Just as in fiat space, the largest quantum effects derive from the lightest 
particles provided they can avoid being driven to zero amplitude by the 
expansion of the universe. 

Recall from section 2 that quantum fields obey exactly same equations 
as their classical counterparts, so one can understand quantum effects as the 
classical response to the 0-point motion which is required by the uncertainty 
principle. For example, we saw that vacuum polarization worlis the same way 
as classical polarization in a medium if one simply accepts that each mode 
of the electron field has 0-point motion. The amount of 0-point motion a 
field experiences is controlled by its free field mode functions, and whatever 
increases the amount of 0-point motion will strengthen quantum effects. For 
example, we observed from the oscillatory factors of e*^*/^ in the electron 
mode functions fHTl) that the combination of an electron of wave vector k 
and a positron of wave vector p — k can only remain coherent for a time At 
of about, 



At ^ — ^ where E(k) = Jm^c'^ + h^cHkP . (237) 

E{k)+E{p-k) 

That is the only quantum mechanics one needs. The remainder of the compu- 
tation consists of using the Lorentz force law to find the classical polarization 
induced by such a pair being acted upon for time At by a static electric field 

E{k) + E{p-k) [E{k) + E{p-k)f 

Summing over all modes k gives (up to a factor of |) the actual first order 
result fl99m00l) for the vacuum polarization due to a static field, 

J i27r)^ [E{k) + E{p-k)f 
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The reason spacetime expansion strengthens quantum effects is that phys- 
ical wave numbers redshift. Because the FRW geometry is invariant under 
spatial translations, particles are still labeled by constant wave vectors k. 
However, because = 27r/A is the inverse of a coordinate wavelength, 
not the physical wavelength a{t)X, it is really the combination k/a{t) which 
tends to enter physical expressions. The actual mode functions of a massive 
particle are complicated but it is not a bad approximation to think of the 
term E{k)At/h in the phase of a mode function generalizing to, 

E{k)At ft+^iE{t',k) 



h 



I dt' \' ' where Eit, k) = Jm'^c^ + h^c^\k\\ya^(t) . 
Jt n 

(240) 

The expansion of a{t') always makes the accumulated phase smaller than it 
would have been for constant scale factor, so the mode can persist longer and 
we see why the expansion of spacetime strengthens quantum effects. 

Just as in flat space (that is, a{t) = 1) particles with the smallest masses 
remain coherent longest, which is why almost all vacuum polarization comes 
from electrons and positrons, even though there are many other charged 
particle fields. Setting m = in expression (12401) gives the same integral 
(12131) we saw in section 5.4, 

lim f'dfE^S. = c\\k\\ X f'df^ . (241) 



m^oJt h Jt a{t' 

Now recall the key distinction between accelerated expansion and decelera- 
tion: 

• For deceleration (g(t) > 0) the integral (I24ip grows without bound as 
At increases; but 

• For acceleration (g(t) < 0) the integral (12411) approaches a constant as 
At goes to infinity. 

The first fact means that modes in a decelerating universe must eventually 
become decoherent, even though they persist longer than for a static uni- 
verse. The second fact means that the 0-point motion of a sufficiently long 
wavelength mode can persist forever during inflation. 

This is not quite the end of the story because almost all particles develop a 
symmetry known as conformal invariance when they become massless. This 
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symmetry causes their mode functions to fall off like powers of the scale 
factor a{t). For example, if we set the electron mass to zero in fl41l) and then 
account for the FRW geometry, the electron mode function becomes. 



Bi(t, X] k, s) 



-ikct+ik-x 



"'uiik,s), (242) 

i(t)r\llLe-'''^Io'^''M'')+'^-^uAk,s) , (243) 
-I V 2k 



where the spinor wave function Ui{k, s) is unchanged from its flat space value 
( 1371) with m = 0. Of course this decreasing amplitude tends to suppress 
0-point motion, which weakens quantum effects. 

To complete the third point on my list I need to show that there are 
massless particles whose mode functions avoid being driven to zero. It turns 
out there are two such particles: 

• Scalars like the inflaton (12271) but with zero potential V{ip); and 

• Gravitons. 

It also turns out that the mode functions of gravitons obey the same equations 
as those of the massless inflaton |138j . so I will specialize to the latter. 

We can understand the physics of massless infiatons by specializing the 
Lagrangian (12271) to the FRW geometry (11941) but still considering ip(t,x) 
to be an arbitrary function of space and time, and then using Parseval's 
theorem, 



1 /ip{t,x)\2 



2 V c 
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(245) 



It follows that the field at each Fourier wave vector k behaves as an indepen- 
dent harmonic oscillator with time-dependent mass and frequency. 



m(t) = mQa'^it) 



and 



u{t) 



kc 

a(t) ■ 



(246) 



83 



If we use q{t) to represent the position of such an oscillator its Lagrangian 
and energy are, 



m(t) 



m{t) 



(247) 

All the usual theorems of quantum mechanics apply to this system. In par- 
ticular, the state with minimum energy at any fixed time t must be, 

hkc 



^mia (t) 



2a{t) ■ 



(248) 



However, the time dependence of the scale factor means that the minimum 
energy state at one time is not the same at other times! The usual choice for 
the ground state under these circumstances is called Bunch-Davies vacuum 
|139] and it corresponds to the state which was minimum energy in the distant 
past. 

Let us find an expression for q{t), analogous to ( |25l) . in terms of the raising 
an lowering operators and a for Bunch-Davies vacuum \Q). It takes the 
form. 



a, a 



1 



and a 



n)= . (249) 



q{t) = ae{t) + a'^e*{t) where 
The mode functions obey, 

/ AjC \ 2 ill/ 

e(t) + 3H(t)6(t) + (— - ) 6(t) = and e(t)e*(t) - i(t)6*(t) = — . 

^a[tj^ moa'*(t) 

(250) 

These equations are too difficult to solve for general scale factor a(t) but the 
solution for H{t) = Hj (which corresponds to the q{t) = —1 paradigm we 
employed for primordial inflation in section 5.4) is. 
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Now evaluate the expectation value of the energy operator in Bunch-Davies 
vacuum, first for arbitrary a{t) and then for (12511) . 
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Recall that this system has the energy eigenstates of a harmonic oscillator 
with energies (A^ + ^)huj{t) at any fixed instant in time. So it is completely 
valid to regard the final expression in (12531) as giving the occupation number, 

m-l^r. (254, 

Hence we see that Bunch-Davies vacuum starts out empty. Expression (12511) 
reveals that, for kc ^ Hia{t) the mode function oscillates and falls off like 
l/a{t), which is how a massless, conformally invariant scalar behaves. As 
the scale factor grows, the occupation number (I254p increases, and both 
the oscillations in the mode function and the decrease of its amplitude slow 
down. A key event is first horizon crossing when kc = H{t)a{t), at which 
point the occupation number becomes order one and the mode function be- 
gins approaching a constant. After that the occupation number becomes 
exponentially large, as does the amount of energy in this single mode. 

One might worry that the behavior I have just sketched is very special 
to the case of q(t) = —1 but that is not true. As long as the universe 
is accelerating the product H(t)a(t) grows, and one can see from equation 
(12501) that modes which start with kc ^ H{t)a{t) oscillate and fall off. 



hi r /■* 1 n 

kc:^H(t)a(t) =^ ^(i) ~ J ; — exp -ike dt' —- . (255) 

They are also drawn towards first horizon crossing. If infiation persists long 
enough for them to reach it, the (fcc/a)^ term of (12501) drops out and one can 
see that the mode function approaches a constant. So the behavior I found 
for (I25ip is actually generic to any inflating geometry. 

Of course this is why massless infiatons and gravitons show enhanced 
quantum effects during inflation. Before closing I should comment on the 
sheer wonder of what we found in expression (12531) . This represents the 0- 
point energy of one single mode! Of course one must divide it over the vast 
3-volume of the inflating universe, but summing over the modes which have 
experienced flrst horizon crossing gives a small, macroscopic effect. Another 
important comment is that this result derives from the long wavelength regime 
in which perturhative quantum general relativity should he valid, even if quan- 
tum gravity is described, on the fundamental level, by some other theory, or 
if perturbative methods do not give the correct asymptotic series. My flnal 
comment is that there is no natural mechanism for keeping the inflaton mass- 
less, and if it develops a large mass that will suppress quantum effects during 
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inflation the same way it would in flat space. By contrast, nothing needs to 
be done to keep the graviton massless, so this long wavelength regime during 
inflation is a natural place to look for quantum gravitational effects. 



5.7 Quantum Gravitational Data 

I have already discussed scalar-driven inflation and why one expects enhanced 
quantum effects during this epoch. In this subsection I will sketch the theory 
behind the first quantum gravitational observables ever measured, which are 
cosmological perturbations. The original work on tensor perturbations was 
done by Starobinsky |14U] : Mukhanov and Chibisov did the first calculation 
of scalar perturbations |141j . For more details I recommend the excellent 
recent text by Mukhanov |136j . 

Cosmological perturbations are spacetime dependent fluctuations of the 
full scalar and metric fields around the spatially homogeneous background 
values described in section 5.5, 

ip{t,x) = ipo{t) + Sv{t,x) , (256) 
9t,u{t,x) = g^^{t) + hf,u{t,x) . (257) 

Here g^i,{t) stands for the FRW metric (11941) . 

gooit) = -l , goiit)=0 , g,^{t)=a^mj- (258) 

It is typical to express the graviton using just scalar and tensor fields because 
the linearized Einstein equations cause the vector fields to vanish, 

hoo{t,x) = -2(P{t,x), (259) 

hoi{t,x) = -c^B{t,x), (260) 

hij{t,x) = -2a\t)^{t,x)6,,-2c^^^E{t,x)W{t)hf^{t,x) . (261) 

Here the dynamical graviton field hjj^ is both transverse and traceless, 

E^^5^(^'^^) = = E^r(t,f). (262) 
i=i '^^ 1=1 

Dynamical gravitons are invariant under linearized coordinate transfor- 
mations and their spatial Fourier transforms obey the same equation 
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massless inflaton, 



dt a^it) 



(263) 



We have already seen that for kc ^ H(t)a{t) the solutions are oscillatory 
with amplitudes that fall off like l/a{t). Long after first horizon crossing, in 
the regime for which kc ^ H(t)a(t) the term proportional to {kc/aY drops 
out and the solution goes to a linear combination of a constant and the falling 
indefinite integral, 

1 



dt' 



(264) 



The following combinations of the scalar fields are invariant under lin- 
earized coordinate transformations, 



$(t,f) = (j)(t,x)- 



d 



dt 



Bit, x)-E{t, x) + 2H{t)E{t, x) 
m{t,x) = ^{t,x) + H{t)[B{t,x)-E{t,x)+2H{t)E{t,x 
E{t,x) = 6(p{t,x) - ipoit)\B{t,x)-E{t,x)+2H{t)E{t,x 



(265) 
(266) 
(267) 



The linearized Einstein equations for indices fi = i and v = j imply = $, 
whereupon the linearized /i = 0, = and /i = 0, = equations become. 



6H— + 6H^ — 



dt 



SttG 



2_d_ 

a dx'^ 



1^ 

a dx 



- 0oS - cV(v9o)^ 
SirG . , 



(268) 
(269) 



Of course equation f l269p relates the scalar perturbation to the scalar part of 
the metric perturbation. 



E{t,x) 



4nG 



^{t,x)+H{t)^(t,x) 



(270) 



Substituting this in (12680 . taking the spatial Fourier transform, and mak- 
ing some simplifications eventually results in what has been termed the 
Mukhanov Equation for $(t, k)/ifQ{t), 



2^2 



k^c 



dt a^it) 



e{t)+H{t)d{t) 

e{t) 



' Ht,ky 







(271) 
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where the function 9{t) is, 



The solution of fl27ip is quahtatively similar to that of the massless in- 
flaton. In the regime kc ^ H{t)a{t) the field oscillates and its amplitude 
falls off. Long after first horizon crossing we can again drop the term pro- 
portional to ikcjaf' and the solution becomes a linear combination of Q{t) 
and the indefinite integral, 

It is this second solution which dominates long after first horizon crossing. 

For both the scalar perturbations of /c) and the tensor perturbations 
of hfj'^it, k) we can get good approximate solutions for the mode functions in 
the regime kc ^ H{t)a{t). Canonical quantization fixes their normalization. 
Long after first horizon crossing we can again get good solutions, one of 
which approaches a constant with the other falling. The strength of the 
each perturbation is quantified by its power spectrum. If the spatial Fourier 
transform of some quantum field F{t, x) approaches a constant we define its 
power spectrum Vpik) as. 



n 



\Fit,k)F\t,k')\n) ^ ^ X {2n)'S-\k-k') . (274) 



It is quite a challenge to connect the early, oscillatory regime — for which 
the normalization is known — to the late time regime, long after first horizon 
crossing [142] . Approximate results can be obtained by crudely matching 
the early and late time solutions at the time tk for which wave number k 
experiences first horizon crossing, 

kc = H{tk)a{tk) . (275) 

These mode-matching results are, 

V.{k) ^ ^fM, (276) 
Vh{k) ~ GH\h). (277) 



88 



It should be noted that fl276H277p are the primordial power spectra. What 
is actually measured is the result of complicated but quite well understood 
physics that occurs after inflation, during the subsequent epochs of radia- 
tion domination and matter domination. This is included through a known 
transfer function, so the challenge for fundamental theory is to compute the 
primordial power spectra. It should also be noted that the primordial power 
spectra are almost independent of k because H{t) is nearly constant during 
inflation. The data are therefore typically organized into scalar and tensor 
spectral indices, 

reported at a fiducial wave number. Lately there has been an effort to report 
the first derivative of the scalar spectral index as well [22]. 

One can see from expressions (1276112771) that the scalar power spectrum 
is enhanced by a factor of 1/(1 + q), relative to the tensor one. Because 
q is very near —1 during inflation this is a large enhancement, and that 
is why the tensor signal has so far not been detected. Getting a nonzero 
measurement for Vh{k) is very important because it would tell us the scale 
of primordial inflation. It would also be the first proof that gravitons exist 
and are quantized. 

I want to emphasize that primordial perturbations are a quantum gravi- 
tational effect. Many quantum gravity experts dismiss them because we have 
so far not detected the tensor signal from gravitons, but this is wrong-headed. 
There is no problem with pure gravitons at the lowest order in perturbation 
theory; the problem at lowest order comes from matter, and the scalar per- 
turbation signal probes precisely this effect at an energy scale potentially as 
high as 10^^ GeV. If I had been told only one of the two perturbation spectra 
could be measured and asked to choose which one, I would have picked the 
scalar. This is priceless information, although it would be better if we has a 
unique theory of inflation which made specific predictions. As it is, one can 
accommodate almost any data by changing the model of inflation, and none 
of the models proposed so far is very compelling. 

I should like to close with two more comments, one about experiment 
and the other about theory. First, there are a lot more data on the way. 



89 



The European Space Agency has launched the Planck satellite which will 
try to resolve the primordial tensor spectrum. And this is just one of many 
experiments. The most exciting idea to me personally is the proposal to use 
the 21 centimeter line out to huge redshifts such as ^ ~ 50 |143j . Foregrounds 
will be an enormous problem, and it will require years of labor, but there is 
potentially enough data present to determine the tensor-to-scalar ratio r to 
one part in 10^, compared with the current bound of r < .22 [22j! 

My comment about theory is that we are just beginning to understand 
how to compute quantum corrections during inflation. Nothing like the 
asymptotic scattering theory of fiat space quantum field theory yet exists 
for this environment. Results such as (1276112771) receive quantum corrections 
which Weinberg has studied |144j . He found a peculiar thing: although these 
corrections are suppressed by a very small factor of GH^ < 10~^^, in addition 
to the factor of GH^ already present in the lowest order result (I276p . they 
contain time dependent enhancement factors that grow like \n[a{t) / a{tk)], 
where is the time at which the mode experienced first horizon crossing. 
Nick Tsamis and I have found similar factors in quantum gravitational cor- 
rections to the metric and to the graviton self-energy |145j . and Shun-Pei 
Miao and I have found them in quantum gravitational corrections to fermion 
mode functions [146] . For the modes whose spatial variation we can resolve 
today there have been at most about 120 e-foldings, so these infrared loga- 
rithms represent a terrific enhancement, but not enough to make corrections 
observable. (But they might become observable if the 21-centimeter obser- 
vations fulfill their promise!) However, this is just because one insists on 
being able to resolve the spatial variation. If one studies something which 
is spatially constant, like quantum corrections to the vacuum energy, then 
there can be infrared logarithms from modes which experienced first horizon 
crossing early during a very long period of inflation, and there seems no limit 
on the size of the effect they might give. Which is interest, because it just so 
happens that there is this little problem understanding the vacuum energy . 



6 Conclusions 

This article began with a list of seven questions. I will devote graph to 

each question and the answers that have been developed. Then I will make 
a few additional points and comment on the future. 
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What is the distinction between classical physics and quantum physics 
that makes general relativity give such a wonderful classical theory of gravity 
and such a problematic quantum one? There is no difference between the 
equations of motion for Heisenberg field operators and those of the corre- 
sponding classical theory. Nor is there any distinction in what it means to 
solve those equations; in both cases the general solution consists of expressing 
the dynamical variable at any time in terms of its initial values. These initial 
values are the fundamental degrees of freedom of physics, and it is usual to 
label them by their Fourier wave number k. The key distinction between 
classical and quantum is what the initial values are: in classical physics they 
are numbers and each of them can be set to any value; in quantum physics 

— * 

each mode k contains one or more pairs of non-commuting operators which 
obey the uncertainty principle. One consequence of this is that no mode can 
have less than a minimum 0-point energy. When we say that general rela- 
tivity gives a superb classical theory of gravity, we mean that its results are 
indistinguishable from nature when we set all the large \\k\\ modes to zero. 
There is no problem doing that classically, but it is not permitted in quantum 
mechanics. It is the influence of these high modes which makes general 
relativity so problematic as a quantum theory. 

Why do we have to quantize gravity? Because part of any force field 
is entirely determined by its sources, and the matter fields which source 
gravity are indisputably quantum mechanical, whether or not gravitons are 
quantized or even exist. It used to be argued that we could take the metric 
field to be sourced by the expectation value of the matter stress-energy tensor 
but that view is not tenable under the simplest interpretation of the observed 
anisotropies in the cosmic ray microwave background. If the predicted tensor 
component of these anisotropics can be imaged there will be direct evidence 
for the existence and quantum nature of gravitons as well. 

Why do quantum field theories have divergences? Because continuum 
field theories have an infinite number of modes, and the 0-point motion of 
each one of them contributes a little to typical quantum effects. Spacetime 
may well be discrete on some level but the expansion of the universe by the 
staggering factor of e^^° ~ 10^^ (if we accept the reality of primordial infla- 
tion) means that this discreteness cannot be responsible for keeping quantum 
gravitational effects so small. 

Why are the divergences of quantum general relativity worse than those 
of the other forces? Because the other forces couple to charges which are the 
same for all modes, whereas gravity couples to stress-energy, which grows 
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with the wave number. This means that first order quantum corrections to 
the gravitational field equations produce divergences not just on terms which 
have two derivatives of the metric or zero derivatives of it, but also on terms 
which contain four derivatives of the metric. Two terms of this type are 
possible, the "i?^ counterterm" and the "C^ counterterm." 

How bad is the problem? Adding the and counterterms to the 
gravitational field equations would allow the divergences of quantum gravity 
to be renormalized to all orders, the same way as with other forces [5] . How- 
ever, increasing the number of derivatives in a field equation introduces new 
degrees of freedom. The new degree of freedom associated with the coun- 
terterm is a positive energy particle with spin zero, which poses no essential 
problem. Unfortunately, the new degrees of freedom associated with the 
counterterm comprise a negative energy particle with spin two, which would 
make the universe blow up instantly. The counterterm is necessary at first 
order in gravity plus scalar particles such as the Higgs [2]. It is also needed 
for gravity plus electromagnetism |3] and for gravity plus the weak or strong 
interactions [1]. It is not necessary at first order for pure gravity in four 
spacetime dimensions but this theory requires an equally unacceptable 
counterterm at second order [HIT]. Hence we must either add unacceptable 
counterterms, which gives a finite theory that is virulently unstable, or else 
low order perturbation theory makes divergent predictions. 

What are the main schools of thought about quantizing gravity and why 
do they disagree ? The problem with perturbative quantum general relativity 
arises from a confiict between four things: continuum field theory, quantum 
mechanics, general relativity and perturbation theory. Because the first two 
items on this list are not likely to be at fault the schools of thought on quan- 
tum gravity differ on which of the last two they suspect. Particle theorists 
come from a long tradition of quickly exploiting perturbation theory to get 
results and then rejecting models which fail to measure up. This brought 
great success with the Standard Model, so it seems reasonable to particle 
theorists that they should trust perturbation theory and reject general rel- 
ativity. Relativists come from a long tradition in which general relativity 
was many times alleged to have problems that always disappeared when a 
sufficiently careful analysis was made. So it makes sense to relativists that 
they should reject perturbation theory and instead focus on a painstakingly 
rigorous formulation of quantum general relativity. 

What would we do with the theory of quantum gravity if we had it? The 
strength of quantum gravity corrections from a mode of energy E seems to be 
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roughly GE"^ /he' ~ (£'/10^^ GeV)^. Although we cannot access interesting 
energies in the laboratory, nature reaches these scales in four cases: the initial 
singularity, the final stages of black hole collapse, the final stages of black 
hole evaporation, and during primordial infiation. Ideas about the first three 
are still speculative but the simplest interpretation of current data is that 
the primordial perturbations in the gravitational potential of the Universe 
derive from quantum matter fluctuations during primordial inflation. 

Establishing phenomenological contact with a weak interaction typically 
involves exploiting its special properties. The unique properties of gravity 
are: 

1. One of the gravitational parameters is the cosmological constant A; 

2. It determines the maximum speed at which signals can propagate; 

3. Gravitons have zero mass without being driven to zero amplitude by 
the expansion of the universe; and 

4. The gravitational interaction energy is negative. 

Physicists suspect that a successful theory of quantum gravity will explain 
the enormous disparity between the measured value of the cosmological con- 
stant fll64p and the natural scales of fundamental theory. The use of flrst 
order perturbation theory in the context of point 2 leads to flnite predictions 
for the blurring of images, fluctuations in luminosity and a broadening of 
spectral lines. Some of these effects may be observable in the not-too-distant 
future. Point 3 suggests that inflationary cosmology is a natural venue to 
search for quantum gravitational effects. And point 4 is the basis for the old 
dream of being able to compute the masses of fundamental particle from the 
interactions of their own force flelds. 

An interesting and possibly signiflcant fact strikes one about all four of 
the current approaches to quantizing gravity reviewed in section 4. Each of 
them involves negative energy in some form: 

• The particle theorists' dreams concerning superstrings and on-shell 
finiteness exploit the negative 0-point energy of fermionic superpart- 
ners; 

• The higher counterterms allowed in asymptotic safety would induce 
negative energy degrees of freedom if they were treated nonperturba- 
tively; and 
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• The relativists' dream that quantum general relativity will regulate its 
own ultraviolet divergences relies upon negative gravitational interac- 
tion energies. 

Perhaps this is more than a curiosity. 

My answer to the question posed by the title of this article is that we 
are very far from having a complete quantum theory of gravity. A measure 
of the reliability of current thought (including my own) is this quotation 
from a renowned string theorist in reaction to the initial observations which 
indicated the universe is accelerating [S]: 

I'm sure the data is wrong because string theory predicts a negative 
cosmological constant. 

I recount these words not because string theorists are bad physicists but 
rather because they are among the best our species has ever produced. That 
even they failed points up the folly of trying to guess natural law with nothing 
more to go on than mathematics and aesthetics. If that was not obvious two 
and a half decades ago it is surely beyond dispute now. 

Neither the nature of the problem, nor the prescription for its inevitable 
solution are unique in man's long struggle to understand the universe. The 
historian Colin McEvedy had this to say about the intellectual stagnation of 
late Roman civilization |147j : 

Speculation ran way beyond the testable and dwindled into meta- 
physics; technology remained tradition-bound and sluggish. Only 
the evolution of a scientific stance — one foot inside the bound- 
ary of the known, the other just outside — could have guaranteed 
the superiority, and consequently the integrity, of Mediterranean 
society, and the world was still too young for that. 

The good news for quantum gravity is that we shall not have to endure 
centuries of darkness until a more powerful mode of thought emerges from the 
ashes of our failures. The process of achieving a scientific stance is underway; 
the first data have been taken, and many more are coming. Understanding 
what they have to teach us will likely be a long and painful process, and I 
expect that most of what we currently believe will need to be abandoned. 
But the outcome cannot be in doubt and those who finally win it for us will 
write in the book of human history. 
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